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a-amylase mutants 

FIELD OF THE INVENTION 

The present invention relates to variants (mutants) of parent 

Termamyl-like a-amylases with higher activity at medium 
temperatures and/or high pH. 

BACKGROUND OF THE INVENTION 

a- Amylases (a-1 , 4-glucan-4-glucanohydrolases , EC 3.2.1.1) 
constitute a group of enzymes which catalyze hydrolysis of 
starch and other linear and branched 1 , 4-glucosidic oligo- and 
polysaccharides . 

There is a very extensive body of patent and scientific 
literature relating to this industrially very important class of 

enzymes. A number of a-amylases such as Termamyl-like a- 
amylases variants are known from e.g. WO 90/11352, WO 95/10603, 
WO 95/26397, WO 96/23873 and WO 96/23874. 

Among more recent disclosures relating to a-amylases, WO 
96/23874 provides three-dimensional, X-ray crystal structural 

data for a Termamyl-like a-amylase which consists of the 300 N- 
terminal amino acid residues of the B. amyloliquefaciens 

a-amylase (BAN™) and amino acids 301-483 of the C-terminal end 

of the B. lichen! formis a-amylase comprising the amino acid 
sequence (the latter being available commercially under the 

tradename Termamyl™) , and which is thus closely related to the 

industrially important Bacillus a-amylases (which in the present 
context are embraced within the meaning of the term "Termamyl- 
like a-amylases", and which include, inter alia, the B. 
licheniformis , B. amyloliquefaciens (BAN™) and B. 

stearothermophilus (BSG™ ) a-amylases) . WO 96/23874 further 
describes methodology for designing, on the basis of an analysis 

of the structure of a parent Termamyl-like a-amylase, variants 

of the parent Termamyl-like a-amylase which exhibit altered 
properties relative to the parent. 
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BRIEF DISCLOSURE OF THE INVENTION 

The present invention relates to novel a-amylolytic variants 

(mutants) of a Termamyl-like a-amylase which exhibit improved 

wash performance (relative to the parent ot-amaylase) at high pH 
and at a medium temperature . 

. The term ''medium temperature" means in the context of the 

invention a temperature from 10°C to 60°C, preferably 20°C to 

50°C, especially 30-40°C. 

The term "high pH" means the alkaline pH which is today used 
for washing, more specifically from about pH 8 to 10.5. 

In the context of the invention a "low temperature cc- 
amylase" means an a-amylase which has a relative optimum 
activity in the temperature range from 0-30°C. 

In the context of the invention a "medium temperature a- 
amylase" means an a-amylase which has an optimum activity in the 
temperature range from 30-60°C. For instance, SP690 and SP722 a- 
amylases, respectively, are "medium temperature a-amylases. 

In the context of the invention a "high temperature a- 

amylase" is an a-amylase having the optimum activity in the 

temperature range from 60-110°C. For instance, Termamyl is a 

"high temperature a-amylase. 

Alterations in properties which may be achieved in variants 
(mutants) of the invention are alterations in: 

The stability of the Termamyl-like a-amylase at a pH from 8 to 
10.5, and/or the Ca 2+ stability at pH 8 to 10.5, and/or the 

specific activity at temperatures from 10 to 60°C, preferably 20- 

50°C, especially 30-40°C. 

It should be noted that the relative temperature optimum is 
often dependent on the specific pH used. In other words the 
relative temperature optimum determined at, e.g. pH 8, may be 
substantially different from the relative temperature optimum 
determined at, e.g., pH 10. 

The temperature' s influence on the enzymatic activity 
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The dynamics in the active site and surroundings are 
dependent on the temperature and the amino acid composition and 
of strong importance for the relative temperature optimum of an 
enzyme. By comparing the dynamics of medium and high temperature 

a-amylases, regions of importance for the function of high 

temperature oc-amylases at medium temperatures can be determined. 

The temperature activity profile of the SP722 oc-amaylase (SEQ ID 

NO: 2) and the B. licheniformis ot-amylase (available from Novo 
Nordisk as Termamyl®) (SEQ ID NO: 4) are shown in Figure 2. 

The relative temperature optimum of SP722 in absolute 
activities is shown to be higher at medium range temperatures 

(30-60°C) than the homologous B . licheniformis oc-amylase, which 

has an optimum activity around 60-100°C. The profiles are mainly 
dependent on the temperature stability and the dynamics of the 
active site residues and their surroundings. Further , the 
activity profiles are dependent on the pH used and the pKa of 
the active site residues. 

In the first aspect the invention relates to a variant of a 

parent Termamyl-like oc-amylase, which variant has a-amylase 
activity, said variant comprises one or more mutations 
corresponding to the following mutations in the amino acid 
sequence shown in SEQ ID NO: 2: 

T141, K142, F143, D144, F145, P146, G147, R148, G149, 
Q174, R181, G182, D183, G184, K185, A186, W189, S193, N195, 
H107, K108, G109,D166, W167, D168, Q169, S170, R171, Q172, F173, 
F267, W268, K269, N270, D271, L272, G273, A274, L275, K311, 
E346, K385, G456, N457, K458, P459, G460, T461, V462, T463. 

A variant of the invention have one or more of the following 
substitutions or deletions: 

T141A, D,R,N,C,E,Q,G,H, I,L, K,M, F, P, S,W, Y,V; 
K14 2A, D,R,N,C,E,Q,G,H, I,L,M, F, P, S,T,W, Y, V; 
F14 3A, D,R,N,C,E,Q,G,H, I,L,K,M,P,S,T,W,Y,V; 
D14 4A,R,N,C,E,Q,G,H, I, L, K, M, F, P, S, T, W, Y, V; 
F14 5A, D,R,N,C,E,Q,G,H, I,L,K,M, P, S,T,W, Y, V; 
P14 6A, D,R,N,C,E,Q,G,H, I,L,K,M, F, S, T,W, Y, V; 
G147A, D, R,N,C, E f Q, H, I,L,K,M, F, P, S, T, W, Y, V; 
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R14 8A, D,N,C,E,Q,G,H, I , L, K,M, F, P,S,T,W,Y,V; 
G14 9A, D,R,N,C,E,Q,H, I,L,K,M, F,P,S,T,W, Y,V; 
R181*, A, D,N,C,E,Q,G,H, I,L,K,M, F, P, S , T, W, Y, V; 
G182*,A,D,R,N,C,E,Q,H, I,L,K,M, F, P, S , T , W, Y, V; 
D18 3*,A,R,N,C,E,Q,G,H, I,L,K,M,F, P, S , T , W, Y, V; 
G184*,A,R,D,N,C,E,Q,H, I,L,K,M,F, P,S,T,W,Y,V; 
K18 5A,D,R,N,C,E,Q,G,H, I,L,M, F, P, S , T , W, Y, V; 
A18 6D,R,N,C,E,Q,G,H, I,L,K,M, F,P,S,T,W,Y,V; 
W18 9A,D,R,N,C,E,Q,G,H, I,L,K,M, F, P, S, T, Y, V; 
S193A, D,R,N,C,E,Q,G,H, I,L,K,M, F, P,T,W, Y,V; 
N195A, D,R,C,E,Q,G,H, I,L,K,M, F, P, S,T,W, Y,V; 
H107A, D,R,N,C,E,Q,G,I,L,K,M,F,P,S,T,W,Y,V; 
K108A, D,R,N,C,E,Q,G,H, I, L, M, F,P,S,T,W,Y,V; 
G10 9A,D,R,N,C,E,Q,H, I, L, K,M, F, P, S,T, W, Y, V; 

D166A,R,N,C,E,Q,G,H, I , L, K, M, F, P, S , T, W, Y, V; 
W167A, D,R,N,C,E,Q,G,H,I,L,K,M,F,P,S,T,Y,V; 

D168A,R,N,C,E,Q,G,H, I,L,K,M, F, P, S , T, W, Y, V; 

Q169A, D,R,N,C,E,G,H, I,L,K,M, F, P, S , T , W , Y , V ; 

S17 0A,D,R,N,C,E,Q,G,H, I,L,K,M, F, P,T,W, Y,V; 

R171A,D,N,C,E,Q,G,H, I,L,K,M, F, P, S, T, W, Y, V; 

Q172A,D,R,N,C,E,G,H, I,L,K,M, F, P, S, T, W, Y, V; 

F17 3A, D,R,N,C,E,Q,G,H, I,L,K,M, P,S,T,W,Y,V; 

Q17 4*,A, D,R,N,C,E,G,H, I , L , K , M , F, P,S,T,W,Y,V; 

F267A,D,R,N,C,E,Q,G,H, I,L,K,M, P,S,T,W, Y,V; 

W268A, D,R,N,C,E,Q,G,H, I,L,K,M, F, P, S,T, Y, V; 

K2 69A, D,R,N,C,E,Q,G,H, I,L,M, F, P, S , T , W, Y, V; 

N27 0A, D,R, C,E,Q,G,H, I , L , K , M , F, P, S,T,W, Y,V; 

D271A,R,N,C,E,Q,G,H, I,L,K,M, F,P,S,T,W,Y,V; 

L272A, D,R,N,C,E,Q,G,H,I,K,M,F,P,S,T,W,Y,V; 

G27 3A, D, R,N,C,E,Q,H, I, L,K,M, F, P, S , T , W, Y, V; 
A274D, R, N , C, E, Q, G, H, I , L, K, M, F,P,S,T,W,Y,V; 

L27 5A, D,R,N,C,E,Q,G,H,I,K,M,F,P,S,T,W,Y,V; 

K311A, D,R,N,C,E,Q,G,H, I,L,M,F, P,S,T,W,Y,V; 

E34 6A, D,R,N,C,Q,G,H, I,K,L,M,F, P,S,T,W,Y,V; 

K385A, D,R,N,C,E,Q, G,H, I , L , M , F, P, S , T , W , Y , V ; 

G4 56A, D,R,N,C,E,Q,H, I,L,K,M, F, P, S, T, W, Y, V; 

N4 57A, D,R,C,E,Q,G,H, I, L, K,M, F, P, S, T, W, Y,V; 
K4 58A, D,R,N,C,E,Q,G,H,I,L,M,F,P,S,T,W,Y,V; 
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P459A, D,R,N,C, E,Q,G,H, I,L,K,M, F, S,T,W, Y, V; 
G4 60A, D,R,N,C,E,Q,H, I , L, K, M, F, P, S , T, W, Y, V; 
T4 61A, D,R,N,C,E,Q,G,H f I , L, K, M, F, P, S,W, Y, V; 
V4 62A, D, R, N,C, E, Q,G, H r I, L, K, M, F, P, S f T, W, Y; 
5 T4 63A, D,R,N,C,E,Q,G,H, I , L, K, M, F, P, S , W, Y, V . 

Preferred are variants having one or more of the following 
substitutions or deletions: 

K142R; S193P; N195F; K269R,Q; N270Y,R, D; K311R; E346Q; K385R; 
K458R; P459T; T461P; Q174*; R181Q,N,S; G182T,S,N; D183*; G184*; 
10 K185A, R, D,C,E,Q,G,H, I,L,M,N, F , P, S , T , W, Y, V; A18 6T , S , N , I , V, R; 
189T, S,N,Q. 

Especially preferred are variants having a deletion in 
positions D183 and G184 and further one or more of the following 
substitutions or deletions: 
15 K142R; S193P; N195F; K269R,Q; N270Y,R,D; K311R; E346Q; K385R; 
K458R; P459T; T461P; Q174*; R181Q,N,S; G182T,S,N; 
K18 5A,R, D,C,E,Q,G,H, I,L,M,N, F, P,S,T,W, Y,V; A18 6T, S , N, I , V, R; 
W189T,S, N,Q. 

The variants of the invention mentioned above exhibits an 
20 alteration in at least one of the following properties relative 

to the parent oc-amylase: 

i) improved pH stability at a pH from 8 to 10.5; and/or 

ii) improved Ca 2+ stability at pH 8 to 10.5, and/or 

iii) increased specific activity at temperatures from 10 to 60°C, 

25 preferably 20-50°C, especially 30-40°C. Further, details will be 
described below. 

The invention further relates to DNA constructs encoding 
variants of the invention; to methods for preparing variants of 
the invention; and to the use of variants of the invention, 
30 alone or in combination with other enzymes, in various 
industrial products or processes, e.g., in detergents or for 
starch liquefaction . 

In a final aspect the invention relates to a method of 

providing a-amylases with altered pH optimum, and/or altered 
35 temperature optimum, and/or improved stability. 

Nomenclature 

In the present description and claims, the conventional one- 
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letter and three-letter codes for amino acid residues are used. 

For ease of reference, a-amylase variants of the invention are 
described by use of the following nomenclature: 
Original amino acid ( s ): position ( s ): substituted amino acid(s) 

According to this nomenclature, for instance the substitution 
of alanine for asparagine in position 30 is shown as: 

Ala30Asn or A30N 

a deletion of alanine in the same position is shown as: 

Ala30* or A30* 
and insertion of an additional amino acid residue, such as 
lysine, is shown as: 

Ala30AlaLys or A3 OAK 

A deletion of a consecutive stretch of amino acid residues, 
such as amino acid residues 30-33, is indicated as (30-33)* or 

A(A30-N33) . 

Where a specific a-amylase contains a "deletion" in 

comparison with other a-amylases and an insertion is made in 
such a position this is indicated as: 

*36Asp or *36D 
for insertion of an aspartic acid in position 36 
Multiple mutations are separated by plus signs, i.e.: 

Ala30Asp + Glu34Ser or A30N+E34S 

representing mutations in positions 30 and 34 substituting 
alanine and glutamic acid for asparagine and serine, 
respectively . 

When one or more alternative amino acid residues may be 
inserted in a given position it is indicated as 
A30N,E or 
A30N or A30E 

Furthermore, when a position suitable for modification is 
identified herein without any specific modification being 
suggested, it is to be understood that any amino acid residue 
may be substituted for the amino acid residue present in the 
position. Thus, for instance, when a modification of an alanine 
in position 30 is mentioned, but not specified, it is to be 
understood that the alanine may be deleted or substituted for 
any other amino acid, i.e., any one of: 
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R, N,D,A,C, Q,E, G,H, I,L,K,M, F, P, S,T,W, Y,V. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is an alignment of the amino acid sequences of six 

5 parent Termamyl-like a-amylases . The numbers on the extreme left 

designate the respective amino acid sequences as follows: 

1: SEQ ID NO: 2 

2 : Kaoamyl 

3: SEQ ID NO: 1 
10 4: SEQ ID NO: 5 

5: SEQ ID NO: 4 

6: SEQ ID NO: 3. 

Figure 2 shows the temperature activity profile of SP722 (SEQ 

ID NO: 2) (at pH 9) and B. licheniformis a-amylase (SEQ ID NO: 
15 4) (at pH 7 . 3) . 

Figure 3 shows the temperature profile for SP690 (SEQ ID NO: 

1), SP722 (SEQ ID NO: 2), B. licheniformis a-amylase (SEQ ID NO: 
4) at pH 10. 

Figure 4 is an alignment of the amino acid sequences of five 

20 a-amylases. The numbers on the extreme left designate the 
respective amino acid sequences as follows: 
1 : amy p__mo use 
2 : amyp_rat 

3: amyp_pig porcine pancreatic alpha-amylase (PPA) 
25 4 : amyp_human 

5: amy_altha A. haloplanctis alpha-amylase (AHA) 

DETAILED DISCLOSURE OF THE INVENTION 

The Termamyl-like a -amylase 

30 It is well known that a number of a-amylases produced by 

Bacillus spp. are highly homologous on the amino acid level. For 

instance, the B. licheniformis a-amylase comprising the amino 
acid sequence shown in SEQ ID NO: . 4 (commercially available as 

Termamyl™) has been found to be about 8 9% homologous with the B. 

35 amyloliquefaciens a-amylase comprising the amino acid sequence 
shown in SEQ ID NO: 5 and about 7 9% homologous with the B. 
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stearothermophilus a-amylase comprising the amino acid sequence 

shown in SEQ ID NO: 3. Further homologous a-amylases include an 

a-amylase derived from a strain of the Bacillus sp. NCIB 12289, 
NCIB 12512, NCIB 12513 or DSM 9375, all of which are described 

in detail in WO 95/26397, and the a-amylase described by 
Tsukamoto et al . , Biochemical and Biophysical Research 
Communications , 151 (1988), pp. 25-31, (see SEQ ID NO: 6). 

Still further homologous a-amylases include the a-amylase 
produced by the B. lichen! formis strain described in EP 0252666 

(ATCC 27811), and the a-amylases identified in WO 91/00353 and 
WO 94/18314. Other commercial Termamyl-like B. licheni formis 
a-amylases are comprised in the products Optitherm™ and 
Takatherm™ (available from Solvay) , Maxamyl™ (available from 
Gist-brocades/Genencor) , Spezym AA™ and Spezyme Delta AA™ 

(available from Genencor) , and Keistase™ (available from Daiwa) . 

Because of the substantial homology found between these a- 

amylases, they are considered to belong to the same class of a- 

amylases, namely the class of "Termamyl-like a-amylases". 

Accordingly, in the present context, the term "Termamyl-like 

a-amylase" is intended to indicate an a-amylase which, at the 

amino acid level, exhibits a substantial homology to Termamyl™, 

i.e., the B. licheni formis a-amylase having the amino acid 
sequence shown in SEQ ID NO: 4 herein. In other words, all the 

following a-amylases which has the amino acid sequences shown in 
SEQ ID NOS: 1, 2, 3, 4, 5, 6, 7 or 8 herein, or the amino acid 
sequence shown in SEQ ID NO: 1 of WO 95/26397 (the same as the 
amino acid sequence shown as SEQ ID NO: 7 herein) or in SEQ ID 
NO: 2 of WO 95/26397 (the same as the amino acid sequence shown 
as SEQ ID NO: 8 herein) or in Tsukamoto et al., 1988, (which 
amino acid sequence is shown in SEQ ID NO: 6 herein) are 

considered to be "Termamyl-like a-amylase" . Other Termamyl-like 

a-amylases are a-amylases i) which displays at least 60%, such 
as at least 70%, e.g., at least 75%, or at least 80%, e.g., at 
least 85%, at least 90% or at least 95% homology with at least 
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one of said amino acid sequences shown in SEQ ID NOS: 1-8 and/or 
ii) displays immunological cross-reactivity with an antibody 

raised against at least one of said a-amylases, and/or iii) is 
encoded by a DNA sequence which hybridizes to the DNA sequences 

encoding the above-specified a-amylases which are apparent from 
SEQ ID NOS: 9, 10, 11, or 12 of the present application (which 
encoding sequences encode the amino acid sequences shown in SEQ 
ID NOS: 1, 2, 3, 4 and 5 herein, respectively), from SEQ ID NO: 
4 of WO 95/26397 (which DNA sequence, together with the stop 
codon TAA, is shown in SEQ ID NO: 13 herein and encodes the 
amino acid sequence shown in SEQ ID NO: 8 herein) and from SEQ 
ID NO: 5 of WO 95/26397 (shown in SEQ ID NO: 14 herein), 
respectively . 

In connection with property i) , the "homology" may be 
determined by use of any conventional algorithm, preferably by 
use of the GAP progamme from the GCG package version 7.3 (June 
1993) using default values for GAP penalties, which is a GAP 
creation penalty of 3.0 and GAP extension penalty of 0.1, 
(Genetic Computer Group (1991) Programme Manual for the GCG 
Package, version 7, 575 Science Drive, Madison, Wisconsin, USA 
53711) . 

A structural alignment between Termamyl (SEQ ID NO: 4) and a 
Termamyl-like a-amylase may be used to identify equiva- 
lent/corresponding positions in other Termamyl-like a-amylases. 
One method of obtaining said structural alignment is to use the 
Pile Up programme from the GCG package using default values of 
gap penalties, i.e., a gap creation penalty of 3.0 and gap 
extension penalty of 0.1. Other structural alignment methods 
include the hydrophobic cluster analysis (Gaboriaud et al . , 
(1987), FEBS LETTERS 224, pp. 149-155) and reverse threading 
(Huber, T ; Torda, AE, PROTEIN SCIENCE Vol. 7, No. 1 pp. 142- 
149 (1998) . 

Property ii) of the a-amylase, i.e., the immunological 
cross reactivity, may be assayed using an antibody raised 
against, or reactive with, at least one epitope of the relevant 

Termamyl-like a-amylase. The antibody, which may either be 
monoclonal or polyclonal, may be produced by methods known in 
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the art, e.g., as described by Hudson et al., Practical 
Immunology, Third edition (1989) , Blackwell Scientific 
Publications. The immunological cross-reactivity may be determi- 
ned using assays known in the art, examples of which are Western 
5 Blotting or radial immunodiffusion assay, e.g., as described by 
Hudson et al., 1989. In this respect, immunological cross- 
reactivity between the ct-amylases having the amino acid 
sequences SEQ ID NOS : 1, 2, 3, 4, 5, 6, 7, or 8, respectively, 
has been found. 

10 The oligonucleotide probe used in the characterisation of the 

Termamyl-like a-amylase in accordance with property iii) above 
may suitably be prepared on the basis of the full or partial 

nucleotide or amino acid sequence of the a-amylase in question. 
Suitable conditions for testing hybridisation involve pre- 

15 soaking in 5xSSC and prehybridizing for 1 hour at -40 °C in a 
solution of 20% formamide, 5xDenhardt ! s solution, 50mM sodium 
phosphate, pH 6.8, and 50mg of denatured sonicated calf thymus 
DNA, followed by hybridisation in the same solution supplemented 
with lOOmM ATP for 18 hours at ~40°C, followed by three times 

20 washing of the filter in 2xSSC, 0.2% SDS at 40°C for 30 minutes 

(low stringency) , preferred at 50°C (medium stringency) , more 

preferably at 65°C (high stringency) , even more preferably at 

~75°C (very high stringency) . More details about the 
hybridisation method can be found in Sambrook et al., Molecular 
25 Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor, 1989. 

In the present context, "derived from" is intended not only 

to indicate an a-amylase produced or producible by a strain of 

the organism in question, but also an a-amylase encoded by a DNA 
sequence isolated from such strain and produced in a host or- 
30 ganism transformed with said DNA sequence. Finally, the term is 

intended to indicate an a-amylase which is encoded by a DNA 
sequence of synthetic and/or cDNA origin and which has the 

identifying characteristics of the a-amylase in question. The 

term is also intended to indicate that the parent a-amylase may 

35 be a variant of a naturally occurring a-amylase, i.e. a variant 
which is the result of a modification (insertion, substitution, 
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deletion) of one or more amino acid residues of the naturally 
occurring cc-amylase . 

Parent hybrid a -amylases 

The parent a-amylase (i.e., backbone a-amylase) may be a 

hybrid a-amylase, i.e., an a-amylase which comprises a combi- 
nation of partial amino acid sequences derived from at least two 

a-amylases . 

The parent hybrid a-amylase may be one which on the basis of 
amino acid homology and/or immunological cross-reactivity and/or 
DNA hybridization (as defined above) can be determined to belong 

to the Termamyl-like a-amylase family. In this case, the hybrid 

a-amylase is typically composed of at least one part of a 

Termamyl-like a-amylase and part(s) of one or more other a- 

amylases selected from Termamyl-like a-amylases or non-Termamyl- 

like a-amylases of microbial (bacterial or fungal) and/or 
mammalian origin. 

Thus, the parent hybrid a-amylase may comprise a combination 
of partial amino acid sequences deriving from at least two 

Termamyl-like a-amylases, or from at least one Termamyl-like and 

at least one non-Termamyl-like bacterial a-amylase, or from at 

least one Termamyl-like and at least one fungal a-amylase. The 

Termamyl-like a-amylase from which a partial amino acid sequence 

derives may, e.g., be any of those specific Termamyl-like a- 
amylase referred to herein. 

For instance, the parent a-amylase may comprise a C-terminal 

part of an a-amylase derived from a strain of B. licheniformis , 

and a N-terminal part of an a-amylase derived from a strain of 
B. amyloliquefaciens or from a strain of B. stearothermophilus . 

For instance, the parent a-amylase may comprise at least 430 
amino acid residues of the C-terminal part of the B. 

licheniformis a-amylase, and may, e.g., comprise a) an amino 
acid segment corresponding to the 37 N-terminal amino acid 
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residues of the B . amyloliquefaciens a-amylase having the amino 
acid sequence shown in SEQ ID NO: 5 and an amino acid segment 
corresponding to the 445 C-terminal amino acid residues of the 

B. licheniformis a-amylase having the amino acid sequence shown 
5 in SEQ ID NO: 4, or 

a hybrid Termamyl-like a-amylase being identical to the 

Termamyl sequence, i.e., the Bacillus licheniformis a-amylase 

shown in SEQ ID NO: 4, except that the N-terminal 35 amino acid 

residues (of the mature protein) has been replaced by the N- 

10 terminal 33 residues of BAN (mature protein), i.e., the 

Bacillus amyloliquefaciens a-amylase shown in SEQ ID NO: 5; or 
b) an amino acid segment corresponding to the 68 N-terminal 

amino acid residues of the B . stearothermophilus a-amylase 
having the amino acid sequence shown in SEQ ID NO: 3 and an 
15 amino acid segment corresponding to the 415 C-terminal amino 

acid residues of the B. licheniformis a-amylase having the amino 
acid sequence shown in SEQ ID NO: 4. 

Another suitable parent hybrid a-amylase is the one 
previously described in WO 96/23874 (from Novo Nordisk) 
20 constituting the N-terminus of BAN, Bacillus amyloliquefaciens 

a-amylase (amino acids 1-300 of the mature protein) and the C- 
terminus from Termamyl (amino acids 301-483 of the mature 
protein) . Increased activity was achieved by substituting one or 

more of the following positions of the above hybrid a-amylase 
25 (BAN: 1-300 /Termamyl : 301-483) : Q360, F290, and N102. Particularly 
interesting substitutions are one or more of the following 
substitutions: Q360E, D; F2 90A,C, D, E, G, H, I , K, L, M, N, P, Q, R, S, T; 
N102D,E; 

The corresponding positions in the SP722 a-amylase shown in 
30 SEQ ID NO: 2 are one or more of: S365, Y295, N106. Corresponding 

substitutions of particular interest in said a-amylase shown in 
SEQ ID NO: 2 are one or more of: S365D,E; Y295 
A,C,D,E,G,H,I,K,L,M,N,P,Q,R,S,T; and N106D,E. 

The corresponding positions in the SP690 a-amylase shown in 
35 SEQ ID NO: 1 are one or more of: S365, Y295, N106. The 
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corresponding substitutions of particular interest are one or 
more of: S365D,E; Y295 A,C,D,E / G,H,I,K,L,M,N,P,Q,R / S,T; N106D,E. 

The above mentioned non-Termamyl-like a-amylase may, e.g., be 

a fungal a-amylase, a mammalian or a plant a-amylase or a 

5 bacterial a-amylase (different from a Termamyl-like a-amylase) . 

Specific examples of such a-amylases include the Aspergillus 

oryzae TAKA a-amylase, the A. niger acid a-amylase, the Bacillus 

subtilis a-amylase, the porcine pancreatic a-amylase and a 

barley a-amylase. All of these a-amylases have elucidated 
10 structures which are markedly different from the structure of a 

typical Termamyl-like a-amylase as referred to herein. 

The fungal a-amylases mentioned above, i.e., derived from A. 
niger and A. oryzae, are highly homologous on the amino acid 
level and generally considered to belong to the same family of 

15 a-amylases. The fungal a-amylase derived from Aspergillus oryzae 

is commercially available under the tradename Fungamyl™. 

Furthermore, when a particular variant of a Termamyl-like a- 
amylase (variant of the invention) is referred to - in a 
conventional manner - by reference to modification (e.g., 
20 deletion or substitution) of specific amino acid residues in the 

amino acid sequence of a specific Termamyl-like a-amylase, it is 

to be understood that variants of another Termamyl-like a- 
amylase modified in the equivalent position (s) (as determined 
from the best possible amino acid sequence alignment between the 
25 respective amino acid sequences) are encompassed thereby. 

In a preferred embodiment of the invention the a-amylase 
backbone is derived from B. licheniformis (as the parent 

Termamyl-like a-amylase), e.g., one of those referred to above, 

such as the B. licheniformis a-amylase having the amino acid 
30 sequence shown in SEQ ID NO: 4. 

Altered properties of variants of the invention 

The following discusses the relationship between mutations 
which are present in variants of the invention, and desirable 



WO 99/23211 PCT/DK98/00471 

14 

alterations in properties (relative to those a parent Termamyl- 
like oc-amylase) which may result therefrom. 

Improved stability at pH 8-10.5 

In the context of the present invention, mutations (including 
amino acid substitutions) of importance with respect to 
achieving improved stability at high pH (i.e., pH 8-10.5) 
include mutations corresponding to mutations in one or more of 

the following positions in SP722 a-amylase (having the amino 
acid sequence shown in SEQ ID NO: 2) : T141, K142, F143, D144, 
F145, P146, G147, R148, G149, R181, A186, S193, N195, K269, 
N270, K311, K458, P459, T461. 

The variant of the invention have one or more of the 
following substitutions (using the SEQ ID NO: 2 numbering): 



T141A, 


D, 


R, 


N , 


c, 


E, 


Qr 


G, 


H, 


I, 


L, 


K, 


M, 


F, 




s, 


w, 


Y, 


V; 


K142A, 


D, 


R, 


N, 


c, 


E, 


Qr 


G, 


H, 


I, 


L, 


M, 


F, 


P, 


s, 


T , 


w, 


Y, 


V; 


F143A, 


D, 


R, 


N , 


c, 


E, 


Qr 


G, 


H, 


I, 


L, 


K, 


M, 


P, 


s, 


T, 


w, 


Y, 


V; 


D144A, 


R, 


N , 


c, 


E/ 


Qf 


G, 


H, 


I, 


L, 


K, 


M f 


F, 


P, 


s, 


T, 


w, 


Y, 


V; 


F145A, 


D, 


R/ 


N, 


c, 


E, 


Qf 


G, 


H, 


I, 


L, 


K, 


M, 


P, 


s, 


T , 


w, 


Y, 


V; 


P146A, 


D, 


R, 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


Mf 


F, 


s, 


T, 


w, 


Y, 


V; 


G147A, 


D, 


Rf 


N, 


c, 


E, 


Q, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T , 


w, 


Y, 


V; 


R148A, 


D, 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, 


V; 


G149A, 


D, 


R, 


N, 


c, 


E, 


Q f 


H, 


I, 


L, 


- K, 


M, 


F, 


P, 


s, 


T , 


• w, 


Y, 


V; 


K181A, 


D, 


R f 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


- L, 


M, 


F, 


P, 


s, 


T, 


,w f 


Y, 


- V; 


A186D, 


R, 


N, 


c, 


E, 


Qi 


, G, 


H, 


I, 


L # 


> P# 


- K, 


M, 


F, 


s, 


T, 


' w # 


Y, 


- V; 


S193A, 


D, 


, R # 


N , 


,C, 




• Q, 


- G, 


-H, 


I, 


,L, 


- K, 


M, 


F, 


p, 




rW # 


Y| 


r V; 


N195A, 


D, 


, R, 


. C, 


, E, 


> Q< 




, H, 


' 1/ 


- L/ 






- F, 


P, 


• s, 


r T t 


r W, 


- Y / 


r V; 


K269A, 


- D / 




, N j 




> E, 




r G, 


, H, 


r I j 






f F, 


- P, 




f T, 


r W, 


r Y, 


r V; 


N270A, 


> D, 


f Rj 


, C, 


>Ei 








J, 


, L, 




, M, 


p F, 


' Pi 




rp 


r W, 


- Yi 


r V; 


K311A, 


- D, 


f R t 


,N 






rQ, 


P G J 






r L 4 


r M j 


r F, 


> Pi 






r W J 




f V; 


K458A, 


, D 


r Rj 


r N 


f c 


r E, 


rQ, 




r H, 


, I 


r L, 


P M ( 


r F, 


r P. 


r S 


r T 




r Y, 


f V; 


P459A, 


r D 


f R, 


r N 


f c 


r E 


,Q 


r G, 


r H, 


, I 




f K 


r M j 


r F, 


r S 


r T 


r w, 


r Y 


r V; 


T461A, 


r D 


r R 


r N 


,c 


r E 


,Q 


r G 


f H, 


r I 




f K 




r F, 


, p 


f s 




r Y 


r v. 



Preferred high pH stability variants include one or more of 

the following substitutions in the SP722 oc-amylase (having the 
amino acid sequence shown in SEQ ID NO: 2) : 

K142R, R181S, A186T, S193P, N195F, K269R, N270Y, K311R, K458R, 
P459T and T461P. 
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In specific embodiments the Bacillus strain NCIB 12512 a- 
amylase having the sequence shown in SEQ ID NO: 1, or the B. 

stearothermophllus a-amylase having the sequence shown in SEQ ID 

NO: 3, or the B. licheniformis a-amylase having the sequence 

shown in SEQ ID NO: 4, or the B. amyloliquefaciens a-amylase 
having the sequence shown in SEQ ID NO: 5 is used as the 

backbone, i.e., parent Termamyl-like a-amylase, for these 
mutations . 

As can been seen from the alignment in Figure 1 the B. 
stearothermophllus a-amylase already has a Tyrosine at position 
corresponding to N270 in SP722. Further, the Bacillus strain 

NCIB 12512 a-amylase, the B. stearothermophllus a-amylase, the 

B. licheniformis a-amylase and the B . amyloliquefaciens a- 
amylase already have Arginine at position corresponding to K458 

in SP722. Furthermore, the B. licheniformis a-amylase already 
has a Proline at position corresponding to T461 in SP722. 

Therefore, for said a-amylases these substitutions are not 
relevant . 

a-amylase variants with improved stability at high pH can be 
constructed by making substitutions in the regions found using 
the molecular dynamics simulation mentioned in Example 2. The 
simulation depicts the region (s) that has a higher flexibility 
or mobility at high pH (i.e., pH 8-10.5) when compared to medium 
pH. 

By using the structure of any bacterial alpha-amylase with 

homology (as defined below) to the Termamyl-like a-amylase 
(BA2) , of which the 3D structure is disclosed in Appendix 1 of 
WO 96/23874 (from Novo Nordisk) , it is possible to modelbuild 
the structure of such alpha-amylase and to subject it to 
molecular dynamics simulations. The homology of said bacterial 

a-amylase may be at least 60%, preferably be more than 70%, more 
preferably more than 80%, most preferably more than 90% 

homologous to the above mentioned Termamyl-like a-amylase (BA2) , 
measured using the UWGCG GAP program from the GCG package 
version 7.3 (June 1993) using default values for GAP penalties 
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[Genetic Computer Group (1991) Programme Manual for the GCG 
Package, version 7 , 575 Science Drive, Madison, Wisconsin, USA 
53711] . Substitution of the unfavorable residue for another 
would be applicable. 

Improved Ca 2+ stability at pH 8-10.5 

Improved Ca 2+ stability means the stability of the enzyme 
under Ca 2+ depletion has been improved. In the context of the 
present invention, mutations (including amino acid 

substitutions) of importance with respect to achieving improved 
Ca 2+ stability at high pH include mutation or deletion in one or 
more positions corresponding to the following positions in the 

SP722 a-amylase having the amino acid sequence shown in SEQ ID 
NO: 2: R181, G182, D183, G184, K185, A186, W189, N195, N270, 
E346, K385, K458, P459. 

A variant of the invention have one or more of the following 
substitutions or deletions: 



R181*, 


A, 


D, 


N, 


c, 


E, 


Q/ 


G, H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T , 


w, 


Y,V 


G182*, 


A, 


D, 


R, 


N , 


c, 


E, 


Qf H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y,V 


D183*, 


A, 


R, 


N, 


c, 


E, 


Q, 


G, H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y,V 


G184*, 


A, 


R, 


D, 


N, 


c, 


E, 


Q,H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, V 


K185A, 


D, 


R, 


N, 


c, 


E, 


G, 


G, H, 


I, 


. L, 


M, 


F, 


P, 


S, 


T, 


W, 


Y, 


V; 


A186D, 


R, 


N, 


c, 


E, 


Q, 


G, 


H, I, 


L, 


, K, 


M, 


F, 


P f 


s, 


T , 


W, 


Y, 


V; 


W189A, 


D, 


R, 




,C, 


E, 


Q, 


G, H, 


I, 


r L, 


K, 


-M, 


F, 


P, 


s, 


- T, 


- Y, 


V; 


N195A, 


D, 


Ri 


c, 


r E, 


• Q, 


,G, 


•H,I, 


L, 


, K, 


M, 


- F, 


P, 


Si 


T , 


,W, 


, Y , 


. V; 


N270A, 


, R, 


D, 


-N, 


X, 


r E i 


' Q; 


■H,I # 


L i 


- K, 


J, 


' F, 


P, 


s, 


, T j 


J, 


f Y , 


-V; 


E346A, 




. D, 


•N, 






, G, 


rH,I, 


• 


r K, 


r M, 


> F# 


P, 


> s, 


, T , 


J, 


> Y , 


rV; 


K385A, 


f R< 


> D, 


,Nj 


f c, 




rQ, 


, G, H, 


- I. 


r L ) 


,M, 


* F, 


' Pi 


' s, 


, T , 


J, 


r Y { 


,V; 


K458A, 


f Ri 


> D, 


r N, 




r E, 




r G, H, 


, I 




, M i 


r F, 


• Pi 


, S, 


r T , 




J, 


rV; 


P459A, 


r Rj 








r E , 


rQ< 


r G,H, 


, I 




r K ( 


r M ( 


- F, 


r s, 


, T ( 




, Y 


r V. 



Preferred are variants having one or more of the following 
substitutions or deletions: 
R181Q,N; G182T,S,N; D183*; G184*; 

K185A,R, D,C,E, Q,G,H, I,L,M,N, F, P,S,T,W, Y,V; Al 8 6T , S , N, I , V; 
W189T,S,N,Q; N195F, N270R,D; E346Q; K385R; K458R; P459T. 

In specific embodiments the Bacillus strain NCIB 12512 <x- 
amylase having the sequence shown in SEQ ID NO: 1, or the B. 

amylollquefaciens a-amylase having the sequence shown in SEQ ID 
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NO: 5, or the B. licheniformis ot-amylase having the sequence 
shown in SEQ ID NO: 4 are used as the backbone for these 
mutations. 

As can been seen from the alignment in Figure 1 the B. 
5 licheniformis a-amylase does not have the positions 

corresponding to D183 and G184 in SP722. Therefore for said a- 
amylases these deletions are not relevant. 

In a preferred embodiment the variant is the Bacillus strain 

NCIB 12512 a-amylase with deletions in D183 and G184 and further 
10 one of the following substitutions: R181Q,N and/or G182T,S,N 
and/or D183*; G184* and/or 

K18 5A, R,D,C,E,Q,G,H,I,L,M,N,F,P,S,T,W,Y,V and/or A18 6T, S,N, I, V 
and/or W189T,S,N,Q and/or N195F and/or N270R,D and/or E346Q 
and/or K385R and/or K458R and/or P459T. 

15 

Increased specific activity at medium temperature 

In a further aspect of the present invention, important 
mutations with respect to obtaining variants exhibiting 

increased specific activity at temperatures from 10-60°C, 

20 preferably 20-50°C, especially 30-40°C, include mutations 
corresponding to one or more of the following positions in the 

SP722 a-amylase having the amino acid sequence shown in SEQ ID 
NO: 2 : 

H107, K108, G109, D166, W167, D168, Q169, S170, R171, Q172, 
25 F173, Q174, D183, G184, N195, F267, W268, K269, N270, D271, 
L272, G273, A274, L275, G456, N457, K458, P459, G460, T461, 
V462, T463. 

The variant of the invention have one or more of the 

following substitutions : 
30 H107A, D,R,N,C,E, Q,G, I,L, K,M, F, P, S,T,W, Y,V; 

K108A, D,R,N,C,E,Q,G,H,I,L,M,F,P,S,T,W,Y,V; 

G109A, D,R,N,C,E,Q,H,I,L,K,M,F,P,S,T,W,Y,V; 

D166A,R,N,C,E,Q,G,H, I , L, K, M, F, P, S , T, W, Y, V; 

W167A, D,R,N,C,E,Q f G,H,I,L,K,M,F,P,S,T,Y f V; 
35 D168A, R, N f C, E, Q, G, H f I , L, K, M, F, P, S , T, W, Y , V; 

Q169A, D f R f N,C / E / G,H f I,L f K / M,F,P f S f T,W f Y / V; 
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15 



20 



25 
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S170A, 


D, 


Rf 


N, 


c, 


E, 


Qf 


G, 


H, 


I, 


L, 


K, 


M, 


F f 


P, 


T f 


w, 


Y, 


V; 


R171A, 




N, 


c, 


Ef 


Q, 


G, 


H f 


I, 


L, 


K f 


M, 


F, 


P, 


s, 


T, 


w, 


Yf 


V; 


Q172A, 


D, 


R, 


N, 


c, 


E, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w f 


Yf 


V; 


F173A, 


D, 


Rf 


N, 


c, 


E, 


Qf 


G, 


H, 


I, 


L, 


K, 


M, 


P, 


Sf 


T f 




Yf 


V; 


Q174*, 


A, 




Rf 


N, 


c, 


E/ 


G, 


H, 


I, 




K, 


M f 


F, 
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Preferred variants has one or more of the following 
substitutions or deletions: Q174*, D183*, G184*, K269S. 

In a specific embodiment the B. licheniformis a-amylase 
having the sequence shown in SEQ ID NO: 4 is used as the 
backbone for these mutations. 



General mutations in variants of the invention: increased 
specific activity at medium temperatures 

The particularly interesting amino acid substitution are 
35 those that increase the mobility around the active site of the 
enzyme. This is accomplished by changes that disrupt stabilizing 
interaction in the vicinity of the active site, i.e., within 
preferably 10A or 8A or 6A or 4A from any of the residues 
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constituting the active site. 

Examples are mutations that reduce the size of side chains, 
such as 
Ala to Gly, 
Val to Ala or Gly, 
lie or Leu to Val, Ala, or Gly 
Thr to Ser 

Such mutations are expected to cause increased flexibility in 
the active site region either by the introduction of cavities 
or by the structural rearrangements that fill the space left by 
the mutation. 

It may be preferred that a variant of the invention comprises 
one or more modifications in addition to those outlined above. 
Thus, it may be advantageous that one or more Proline residues 
present in the part of the ot-amylase variant which is modified 
is/are replaced with a non-Proline residue which may be any of 
the possible, naturally occurring non-Proline residues, and 
which preferably is an Alanine, Glycine, Serine, Threonine, 
Valine or Leucine. 

Analogously, it may be preferred that one or more Cysteine 
residues present among the amino acid residues with which the 
parent oc-amylase is modified is/are replaced with a non-Cysteine 
residue such as Serine, Alanine, Threonine, Glycine, Valine or 
Leucine . 

Furthermore, a variant of the invention may - either as the 
only modification or in combination with any of the above 
outlined modifications - be modified so that one or more Asp 
and/or Glu present in an amino acid fragment corresponding to 
the amino acid fragment 185-209 of SEQ ID NO: 4 is replaced by 
an Asn and/or Gin, respectively. Also of interest is the 
replacement, in the Termamyl-like a-amylase, of one or more of 
the Lys residues present in an amino acid fragment corresponding 
to the amino acid fragment 185-209 of SEQ ID NO: 4 by an Arg. 

It will be understood that the present invention encompasses 
variants incorporating two or more of the above outlined 
modifications . 

Furthermore, it may be advantageous to introduce point- 
mutations in any of the variants described herein. 
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^-amylase variants having increased mobility around the active 
site : 

The mobility of ot-amylase variants of the invention may be 
increased by replacing one or more amino acid residue at one or 
more positions close to the substrate site. These positions are 
(using the SP722 a-amylase (SEQ ID NO: 2) numbering) : V56, K108, 
D168, Q169, Q172, L201, K269, L272, L275, K446, P459. 

Therefore, in an aspect the invention relates to variants 
being mutated in one or more of the above mentioned positions. 

Preferred substitutions are one or more of the following: 
V56A,G,S,T; 

K108A, D , E , Q , G , H , I,L,M,N,S,T,V; 
D168A, G, I, V,N, S,T; 
Q169A, D, G, H, I, L,M,N, S,T, V; 
Q17 2A, D, G, H, I, L,M, N, S, T, V; 
L201A, G, I , V, S , T ; 

K2 69A, D,E,Q,G, H, I,L,M, N, S,T, V; 
L272A,G, I, V, S,T; 
L275A,G, I,V,S,T; 

Y2 95A, D,E,Q,G,H,I,L,M,N,F,S,T,V; 
K4 4 6A, D, E, Q,G, H, I, L,M,N, S,T, V; 
P459A,G, I, L f S f T f V. 

In specific embodiments of the invention the Bacillus strain 

NCIB 12512 a-amylase having the sequence shown in SEQ ID NO: 1, 
or the B . stearothermophilus a-amylase having the sequence shown 
in SEQ ID NO: 3, or the B . licheniformis a-amylase having the 
sequence shown in SEQ ID NO: 4, or the B. amyloliquefaciens a- 
amylase having the sequence shown in SEQ ID NO: 5 are used as 
the backbone for these mutations. 

As can been seen from the alignment in Figure 1 the B. 
licheniformis a-amylase and the B . amyloliquefaciens a-amylase 
have a Glutamine at position corresponding to K269 in SP722. 
Further, the B . stearothermophilus a-amylase has a Serine at 
position corresponding to K269 in SP722. Therefore, for said a- 
amylases these substitutions are not relevant. 

Furthermore, as can been seen from the alignment in Figure 1 
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the B. amyloliquefaciens a-amylase has an Alanine at position 
corresponding to L272 in SP722, and the B . stearothermophilus a- 
amylase has a Isoleucine at the position corresponding to L272 
in SP722. Therefore, for said oc-amylases these substitutions are 
not relevant. 

As can been seen from the alignment in Figure 1, the Bacillus 
strain 12512 a-amylase has a Isoleucine at position 
corresponding to L275 in SP722. Therefore for said a-amylase 
this substitution is not relevant. 

As can been seen from the alignment in Figure 1 the B. 

amyloliquefaciens a-amylase has a Phenylalanine at position 
corresponding to Y295 in SP722. Further, the B. 
stearothermophilus a-amylase has an Asparagine at position 
corresponding to Y295 in SP722. Therefore, for said a-amylases 
these substitutions are not relevant. 

As can been seen from the alignment in Figure 1 the B. 
licheniformis a-amylase and the B. amyloliquefaciens a-amylase 
have a Asparagine at position corresponding to K446 in SP722. 
Further, the B. stearothermophilus a-amylase has a Histidine at 
position corresponding to K446 in SP722. Therefore, for said a- 
amylases these substitutions are not relevant. 

As can been seen from the alignment in Figure 1 the B. 
licheniformis a-amylase, the B . amyloliquefaciens a-amylase and 
the B. stearothermophilus a-amylase have a Serine at position 
corresponding to P459 in SP722. Further, the Bacillus strain 
12512 a-amylase has a Threonine at position corresponding to 
P459 in SP722. Therefore, for said a-amylases these 
substitutions are not relevant. 

Stabilization of enzymes having high activity at medium 
temperatures 

In a further embodiment the invention relates to improving 
the stability of low temperature a-amylases (e.g, Alteromonas 
haloplanctis (Feller et al., (1994), Eur. J. Biochem 222:441- 
447), and medium temperature a-amylases (e.g., SP722 and SP690) 
possessing medium temperature activity, i.e., commonly known as 
psychrophilic enzymes and mesophilic enzymes. The stability can 
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for this particular enzyme class be understood either as 
thermostability or the stability at Calcium depletion 
conditions . 

Typically, enzymes displaying the high activity at medium 
temperatures also display severe problems under conditions that 
stress the enzyme, such as temperature or Calcium depletion. 

Consequently, the objective is to provide enzymes that at the 
same time display the desired high activity at medium 
temperatures without loosing their activity under slightly 
stressed conditions . 

The activity of the stabilized variant measured at medium 
temperatures should preferably be between 100% or more and 50%, 
and more preferably between 100% or more and 70%, and most 
preferably between 100% or more and 85% of the original activity 
at that specific temperature before stabilization of the enzyme 
and the resulting enzyme should withstand longer incubation at 
stressed condition than the wild type enzyme. 

Contemplated enzymes include ot-amylases of, e.g., bacterial 
or fungal origin. 

An example of such a low temerature a-amylase is the one 
isolated from Alteromonas haloplanctis (Feller et al., (1994), 
Eur. J. Biochem 222:441-447). The crystal structure of this 
alpha-amylase has been solved (Aghajari et al., (1998), Protein 
Science 7 : 564-572) . 

The A. haloplanctis alpha-amylase (5 in alignment shown in 
Fig. 4) has a homology of approximately 66% to porcine 
pancreatic alpha-amylase (PPA) (3 in the alignment shown in Fig. 
4). The PPA 3D structure is known, and can be obtained from 
Brookhaven database under the name 10SE or 1DHK. Based on the 
homology to other more stable alpha amylases, stabilization of 
"the low temperature highly active enzyme" from Alteromonas 
haloplanctis alpha-amylase, can be obtained and at the same time 
retaining the desired high activity at medium temperatures. 

Figure 4 shown a multiple sequence alignments of five a- 
amylases, including the AHA and the PPA a-amylase. Specific 
mutations giving increased stability in Alteromonas haloplantis 
alpha-amylase : 

T66P, Q69P, R155P, Q177R, A205P, A232P, L243R, V295P, S315R. 
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Methods for preparing a -amylase variants 

Several methods for introducing mutations into genes are 
known in the art. After a brief discussion of the cloning of <x- 
amylase-encoding DNA sequences, methods for generating mutations 
at specific sites within the a-amylase-encoding sequence will be 
discussed . 

Cloning a DNA sequence encoding an a -amylase 

The DNA sequence encoding a parent a-amylase may be isolated 
from any cell or microorganism producing the a-amylase in 
question, using various methods well known in the art. First, a 
genomic DNA and/or cDNA library should be constructed using 
chromosomal DNA or messenger RNA from the organism that produces 
the a-amylase to be studied. Then, if the amino acid sequence of 
the a-amylase is known, homologous, labeled oligonucleotide 
probes may be synthesized and used to identify a-amylase- 
encoding clones from a genomic library prepared from the 
organism in question. Alternatively, a labeled oligonucleotide 
probe containing sequences homologous to a known a-amylase gene 
could be used as a probe to identify a-amylase-encoding clones, 
using hybridization and washing conditions of lower stringency. 

Yet another method for identifying a-amylase-encoding clones 
would involve inserting fragments of genomic DNA into an ex- 
pression vector, such as a plasmid, transforming a-amylase- 
negative bacteria with the resulting genomic DNA library, and 
then plating the transformed bacteria onto agar containing a 
substrate for a-amylase, thereby allowing clones expressing the 

a-amylase to be identified. 

Alternatively, the DNA sequence encoding the enzyme may be 
prepared synthetically by established standard methods, e.g., 
the phosphoroamidite method described by S.L. Beaucage and M.H. 
Caruthers (1981) or the method described by Matthes et al. 
(1984). In the phosphoroamidite method, oligonucleotides are 
synthesized, e.g., in an automatic DNA synthesizer, purified, 
annealed, ligated and cloned in appropriate vectors. 
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Finally, the DNA sequence may be of mixed genomic and synthe- 
tic origin, mixed synthetic and cDNA origin or mixed genomic and 
cDNA origin, prepared by ligating fragments of synthetic, 
genomic or cDNA origin (as appropriate, the fragments 
corresponding to various parts of the entire DNA sequence) , in 
accordance with standard techniques. The DNA sequence may also 
be prepared by polymerase chain reaction (PCR) using specific 
primers, for instance as described in US 4,683,202 or R.K. Saiki 
et al. (1988) . 

Expression of a -amylase variants 

According to the invention, a DNA sequence encoding the 
variant produced by methods described above, or by any alterna- 
tive methods known in the art, can be expressed, in enzyme form, 
using an expression vector which typically includes control 
sequences encoding a promoter, operator, ribosome binding site, 
translation initiation signal, and, optionally, a repressor gene 
or various activator genes. 

The recombinant expression vector carrying the DNA sequence 
encoding an a-amylase variant of the invention may be any vector 
which may conveniently be subjected to recombinant DNA 
procedures, and the choice of vector will often depend on the 
host cell into which it is to be introduced. Thus, the vector 
may be an autonomously replicating vector, i.e., a vector which 
exists as an extrachromosomal entity, the replication of which 
is independent of chromosomal replication, e.g., a plasmid, a 
bacteriophage or an extrachromosomal element, minichromosome or 
an artificial chromosome. Alternatively, the vector may be one 
which, when introduced into a host cell, is integrated into the 
host cell genome and replicated together with the chromosome ( s ) 
into which it has been integrated. 

In the vector, the DNA sequence should be operably connected 
to a suitable promoter sequence. The promoter may be any DNA 
sequence which shows transcriptional activity in the host cell 
of choice and may be derived from genes encoding proteins either 
homologous or heterologous to the host cell. Examples of 
suitable promoters for directing the transcription of the DNA 
sequence encoding an a-amylase variant of the invention, 
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sequence which shows transcriptional activity in the host cell 
of choice and may be derived from genes encoding proteins either 
homologous or heterologous to the host cell. Examples of 
suitable promoters for directing the transcription of the DNA 
sequence encoding an a-amylase variant of the invention, 
especially in a bacterial host, are the promoter of the lac 
operon of E.coli, the Streptomyces coelicolor agarase gene dagA 
promoters, the promoters of the Bacillus licheniformis a-amylase 
gene {amyL) , the promoters of the Bacillus stearothermophilus 
maltogenic amylase gene (amyM) , the promoters of the Bacillus 
amyloliquefaciens a-amylase (amyQ) , the promoters of the Ba- 
cillus subtilis xylA and xylB genes etc. For transcription in a 
fungal host, examples of useful promoters are those derived from 
the gene encoding A. oryzae TAKA amylase, Rhizomucor miehei 
aspartic proteinase, A. niger neutral a-amylase, A. niger acid 
stable a-amylase, A. niger glucoamylase, Rhizomucor miehei 
lipase, A. oryzae alkaline protease, A. oryzae triose phosphate 
isomerase or A. nidulans acetamidase. 

The expression vector of the invention may also comprise a 
suitable transcription terminator and, in eukaryotes, poly- 
adenylation sequences operably connected to the DNA sequence 
encoding the a-amylase variant of the invention. Termination and 
polyadenylation sequences may suitably be derived from the same 
sources as the promoter. 

The vector may further comprise a DNA sequence enabling the 
vector to replicate in the host cell in question. Examples of 
such sequences are the origins of replication of plasmids pUC19, 
pACYC177, pUBHO, pE194, pAMBl and pIJ702. 

The vector may also comprise a selectable marker, e.g. a gene 
the product of which complements a defect in the host cell, such 
as the dal genes from B . subtilis or B . licheniformis , or one 
which confers antibiotic resistance such as ampicillin, 
kanamycin, chloramphenicol or tetracyclin resistance. Fur- 
thermore, the vector may comprise Aspergillus selection markers 
such as amdS, argB, niaD and sC, a marker giving rise to 
hygromycin resistance, or the selection may be accomplished by 
co-transformation, e.g., as described in WO 91/17243. 
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While intracellular expression may be advantageous in some 
respects, e.g., when using certain bacteria as host cells, it is 
generally preferred that the expression is extracellular. In 
general, the Bacillus a-amylases mentioned herein comprise a 
preregion permitting secretion of the expressed protease into 
the culture medium. If desirable, this preregion may be replaced 
by a different preregion or signal sequence, conveniently accom- 
plished by substitution of the DNA sequences encoding the 
respective preregions . 

The procedures used to ligate the DNA construct of the inven- 
tion encoding an a-amylase variant, the promoter, terminator and 
other elements, respectively, and to insert them into suitable 
vectors containing the information necessary for replication, 
are well known to persons skilled in the art (cf., for instance, 
Sambrook et al., Molecular Cloning: A Laboratory Manual , 2nd 
Ed., Cold Spring Harbor, 1989). 

The cell of the invention, either comprising a DNA construct 
or an expression vector of the invention as defined above, is 
advantageously used as a host cell in the recombinant production 

of an a-amylase variant of the invention. The cell may be 
transformed with the DNA construct of the invention encoding the 
variant, conveniently by integrating the DNA construct (in one 
or more copies) in the host chromosome. This integration is 
generally considered to be an advantage as the DNA sequence is 
more likely to be stably maintained in the cell. Integration of 
the DNA constructs into the host chromosome may be performed 
according to conventional methods, e.g., by homologous or 
heterologous recombination. Alternatively, the cell may be 
transformed with an expression vector as described above in 
connection with the different types of host cells. 

The cell of the invention may be a cell of a higher organism 
such as a mammal or an insect, but is preferably a microbial 
cell, e.g. a bacterial or a fungal (including yeast) cell. 

Examples of suitable bacteria are Gram positive bacteria such 
as Bacillus subtilis , Bacillus licheniformis r Bacillus lentus, 
Bacillus brevis f Bacillus stearothermophilus , Bacillus alkalo- 
philusr Bacillus amyloliquefaciens, Bacillus coagulans , Bacillus 
circulans , Bacillus lautus r Bacillus megaterium, Bacillus 
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thuringiensis , or Streptomyces lividans or Streptomyces murinus, 
or gramnegative bacteria such as E.coli. The transformation of 
the bacteria may, for instance, be effected by protoplast trans- 
formation or by using competent cells in a manner known per se. 

The yeast organism may favorably be selected from a species 
of Saccharomyces or Schizosaccharomyces , e.g. Saccharomyces 
cerevisiae . The filamentous fungus may advantageously belong to 
a species of Aspergillus, e.g., Aspergillus oryzae or Aspergil- 
lus niger. Fungal cells may be transformed by a process involv- 
ing protoplast formation and transformation of the protoplasts 
followed by regeneration of the cell wall in a manner known per 
se. A suitable procedure for transformation of Aspergillus host 
cells is described in EP 238 023. 

In a yet further aspect, the present invention relates to a 

method of producing an a-amylase variant of the invention, which 
method comprises cultivating a host cell as described above 
under conditions conducive to the production of the variant and 
recovering the variant from the cells and/or culture medium. 

The medium used to cultivate the cells may be any convention- 
al medium suitable for growing the host cell in question and 

obtaining expression of the a-amylase variant of the invention. 
Suitable media are available from commercial suppliers or may be 
prepared according to published recipes (e.g. as described in 
catalogues of the American Type Culture Collection) . 

The a-amylase variant secreted from the host cells may con- 
veniently be recovered from the culture medium by well-known 
procedures, including separating the cells from the medium by 
centrif ugation or filtration, and precipitating proteinaceous 
components of the medium by means of a salt such as ammonium 
sulphate, followed by the use of chromatographic procedures such 
as ion exchange chromatography, affinity chromatography, or the 
like . 

Industrial Applications 

The a-amylase variants of this invention possesses valuable 
properties allowing for a variety of industrial applications. In 
particular, enzyme variants of the invention are applicable as a 
component in washing, dishwashing and hard-surface cleaning 
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detergent compositions . 

Numerous variants are particularly useful in the production 
of sweeteners and ethanol from starch, and/or for textile 
desizing. Conditions for conventional starch- conversion 
processes, including starch liquefaction and/or saccharif ication 
processes, are described in, e.g., US 3,912,590 and in EP patent 
publications Nos. 252,730 and 63,909. 

Detergent compositions 

As mentioned above, variants of the invention may suitably be 
incorporated in detergent compositions. Reference is made, for 
example, to WO 96/23874 and WO 97/07202 for further details 
concerning relevant ingredients of detergent compositions (such 
as laundry or dishwashing detergents) , appropriate methods of 
formulating the variants in such detergent compositions, and for 
examples of relevant types of detergent compositions. 

Detergent compositions comprising a variant of the invention 
may additionally comprise one or more other enzymes, such as a 
lipase, cutinase, protease, cellulase, peroxidase or laccase, 

and/or another a-amylase. 

a-amylase variants of the invention may be incorporated in 

detergents at conventionally employed concentrations. It is at 

present contemplated that a variant of the invention may be 

incorporated in an amount corresponding to 0.00001-1 mg 

(calculated as pure, active enzyme protein) of a-amylase per 
liter of wash/dishwash liquor using conventional dosing levels 
of detergent. 

The invention also relates to a method of providing a- 
amylases with 1) altered pH optimum, and/or 2) altered 
temperature optimum, and/or 3) improved stability, comprising 
the following steps: 

i) identifying (a) target position (s) and/or region (s) for 
mutation of the a-amylase by comparing the molecular dynamics of 

two or more a-amylase 3D structures having substantially 
different pH, temperature and/or stability profiles, 

ii) substituting, adding and/or deleting one or more amino acids 
in the identified position(s) and/or region(s). 
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In embodiment of the invention a medium temperature a- 

amylase is compared with a high temperature a-amylase. In 

another embodiment a low temperature a-amylase is compared with 

either a medium or a high temperature a-amylase. 

5 The a-amylases compared should preferably be at least 70% , 

preferably 80%, up to 90%, such as up to 95%, especially 95% 
homologous with each other. 

The oc-amylases compared may be Termamyl-like a-amylases as 

defined above. In specific embodiment the a-amylases compared 

10 are the a-amylases shown in SEQ ID NO: 1 to SEQ ID NO: 8. 

In another embodiment the stability profile of the a-amy- 
lases in question compared are the Ca 2+ dependency profile. 

MATERIALS AND METHODS 

15 Enzymes : 

SP722: (SEQ ID NO: 2, available from Novo Nordisk) 
Termamyl™ (SEQ ID NO: 4, available from Novo Nordisk) 
SP690: (SEQ ID NO: 1, available from Novo Nordisk) 

20 Bacillus subtilis SHA273: see WO 95/10603 

Plasmids 

pJEl contains the gene encoding a variant of SP722 a-amylase 
(SEQ ID NO: 2): viz. deletion of 6 nucleotides corresponding to 
25 amino acids D183-G184 in the mature protein. Transcription of 
the JE1 gene is directed from the amyL promoter. The plasmid 
further more contains the origin of replication and cat-gene 
conferring resistance towards kanamycin obtained from plasmid 
pUBHO (Gryczan, TJ et al. (1978), J. Bact . 134:318-329). 

30 

Methods : 

Construction of library vector pDorKlOl 

The E. coli/ Bacillus shuttle vector pDorKlOl (described 
below) can be used to introduce mutations without expression of 

35 a-amylase in E. coli and then be modified in such way that the 
a-amylase is active in Bacillus. The vector was constructed as 
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follows: The JE1 encoding gene (SP722 with the deletion of 
D183-G184) was inactivated in pJEl by gene interruption in the 
PstI site in the 5 "coding region of the SEQ ID NO: 2: SP722 by 
a 1.2 kb fragment containing an E. coll origin of replication. 
This fragment was PCR amplified from the pUC19 (GenBank 
Accession #:X02514) using the forward primer: 5'- 
gacctgcagtcaggcaacta-3 ' and the reverse primer: 5'- 
tagagtcgacctgcaggcat-3 ' . The PCR amplicon and the pJEl vector 

were digested with PstI at 37°C for 2 hours. The pJEl vector 
fragment and the PCR fragment were ligated at room temperature, 
for 1 hour and transformed in E . coll by electrotransf ormation . 
The resulting vector is designated pDorKlOl. 

Filter screening assays 

The assay can be used to screening of Termamyl-like a- 
amylase variants having an improved stability at high pH 

compared to the parent enzyme and Termamyl-like a-amylase 
variants having an improved stability at high pH and medium 
temperatures compared to the parent enzyme depending of the 
screening temperature setting 

High pH filter assay 

Bacillus libraries are plated on a sandwich of cellulose 
acetate (OE 67, Schleicher & Schuell, Dassel, Germany) - and 
nitrocellulose filters (Protran-Ba 85, Schleicher & Schuell, 

Dassel, Germany) on TY agar plates with 10 p,g/ml kanamycin at 

37°C for at least 21 hours. The cellulose acetate layer is 
located on the TY agar plate. 

Each filter sandwich is specifically marked with a needle 
after plating, but before incubation in order to be able to 
localize positive variants on the filter and the nitrocellulose 
filter with bound variants is transferred to a container with 
glycin-NaOH buffer, pH 8.6-10.6 and incubated at room 

temperature (can be altered from 10°-60°C) for 15 min. The 
cellulose acetate filters with colonies are stored on the TY- 
plates at room temperature until use. After incubation, 
residual activity is detected on plates containing 1% agarose, 
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0.2% starch in glycin-NaOH buffer, pH 8.6-10.6. The assay 
plates with nitrocellulose filters are marked the same way as 
the filter sandwich and incubated for 2 hours. at room 
temperature. After removal of the filters the assay plates are 
5 stained with 10% Lugol solution. Starch degrading variants are 
detected as white spots on dark blue background and then 
identified on the storage plates. Positive variants are 
rescreened twice under the same conditions as the first screen. 

10 Low calcium filter assay 

The Bacillus library are plated on a sandwich of 
cellulose acetate (OE 67, Schleicher & Schuell, Dassel, 
Germany) - and nitrocellulose filters (Protran-Ba 85, 
Schleicher & Schuell, Dassel, Germany) on TY agar plates with a 

15 relevant antibiotic, e.g., kanamycin or chloramphenicol, at 37°C 
for at least 21 hours. The cellulose acetate layer is located 
on the TY agar plate. 

Each filter sandwich is specifically marked with a needle 
after plating, but before incubation in order to be able to 

20 localize positive variants on the filter and the nitrocellulose 
filter with bound variants is transferred to a container with 
carbonate/bicarbonate buffer pH 8.5-10 and with different EDTA 
concentrations (0.001 mM - 100 mM) . The filters are incubated 
at room temperature for 1 hour. The cellulose acetate filters 

25 with colonies are stored on the TY-plates at room temperature 
until use. After incubation, residual activity is detected on 
plates containing 1% agarose, 0.2% starch in 

carbonate/bicarbonate buffer pH 8.5-10. The assay plates with 
nitrocellulose filters are marked the same way as the filter 

30 sandwich and incubated for 2 hours, at room temperature. After 
removal of the filters the assay plates are stained with 10% 
Lugol solution. Starch degrading variants are detected as white 
spots on dark blue background and then identified on the 
storage plates. Positive variants are rescreened twice under 

35 the same conditions as the first screen. 

Method for obtaining the regions of interest: 

There are three known 3D structures of bacterial a- 
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amylases. Two of B. licheniformis a-amylase, Brookhaven 
database 1BPL (Machius et al . (1995), J. Mol. Biol. 246, p. 
545-559) and 1VJS (Song et al. (1996), Enzymes for Carbohydrate 
163 Engineering (Prog. Biotechnol . V 12). These two structures 
are lacking an important piece of the structure from the so- 
called B-domain, in the area around the two Calcium ions and 
one Sodium ion binding sites. We have therefore used a 3D 

structure of an a-amylase BA2 (WO 96/23874 which are a hybrid 

between BAN™ (SEQ ID NO. 5) and B . licheniformis a-amylase (SEQ 
ID NO. 4). On basis of the structure a model of B. 

licheniformis alpha amylase and the SP722 a-amylase has been 
build. 



Fermentation and purification of a -amylase variants 

Fermentation and purification may be performed by methods 
well known in the art. 



Stability determination 

All stability trials are made using the same set up. The 
method are: 

The enzyme is incubated under the relevant conditions (1-4). 
Samples are taken at various time points, e.g., after 0, 5, 10, 
15 and 30 minutes and diluted 25 times (same dilution for all 
taken samples) in assay buffer (0.1M 50mM Britton buffer pH 
7.3) and the activity is measured using the Phadebas assay 
(Pharmacia) under standard conditions pH 7.3, 37 °C. 

The activity measured before incubation (0 minutes) is used 
as reference (100%) . The decline in percent is calculated as a 
function of the incubation time. The table shows the residual 
activity after, e.g., 30 minutes of incubation. 



Specific activity determination 
The specific activity is 
assay (Pharmacia) as activit 

instructions are followed (see 
amylase activity) . 



determined using the Phadebas 
y/mg enzyme. The manufactures 

also below under "Assay for a- 
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Assays for a-Amylase Activity 

1. Phadebas assay 

a-amylase activity is determined by a method employing 
Phadebas® tablets as substrate. Phadebas tablets (Phadebas® 
Amylase Test, supplied by Pharmacia Diagnostic) contain a cross- 
linked insoluble blue-colored starch polymer which has been 
mixed with bovine serum albumin and a buffer substance and 
tabletted. 

For every single measurement one tablet is suspended in a 
tube containing 5 ml 50 mM Britton-Robinson buffer (50 mM acetic 
acid, 50 mM phosphoric acid, 50 mM boric acid, 0.1 mM CaCl 2 , pH 
adjusted to the value of interest with NaOH) . The test is 

performed in a water bath at the temperature of interest. The a- 
amylase to be tested is diluted in x ml of 50 mM Britton- 
Robinson buffer. 1 ml of this a-amylase solution is added to the 
5 ml 50 mM Britton-Robinson buffer. The starch is hydrolyzed by 

the a-amylase giving soluble blue fragments. The absorbance of 
the resulting blue solution, measured spectrophotometrically at 

620 nm, is a function of the a-amylase activity. 

It is important that the measured 620 nm absorbance after 10 
or 15 minutes of incubation (testing time) is in the range of 
0.2 to 2.0 absorbance units at 620 nm. In this absorbance range 
there is linearity between activity and absorbance (Lambert-Beer 
law) . The dilution of the enzyme must therefore be adjusted to 
fit this criterion. Under a specified set of conditions (temp., 

pH, reaction time, buffer conditions) 1 mg of a given a-amylase 
will hydrolyze a certain amount -,of substrate and a blue colour 
will be produced. The colour intensity is measured at 620 nm. 
The measured absorbance is directly proportional to the specific 

activity (activity/mg of pure a-amylase protein) of the a- 
amylase in question under the given set of conditions. 

2. Alternative method 

a-amylase activity is determined by a method employing the 
PNP-G7 substrate. PNP-G7 which is a abbreviation for p- 

nitrophenyl-a, D-maltoheptaoside is a blocked oligosaccharide 
which can be cleaved by an endo-amylase . Following the cleavage, 
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the a-Glucosidase included in the kit digest the substrate to 
liberate a free PNP molecule which has a yellow colour and thus 

can be measured by visible spectophometry at A,= 405nm. (400-420 

nm. ) . Kits containing PNP-G7 substrate and a-Glucosidase is 
5 manufactured by Boehringer-Mannheim (cat. No. 1054635). 

To prepare the substrate one bottle of substrate (BM 

1442309) is added to 5 ml buffer (BM1442309) . To prepare the a- 

Glucosidase one bottle of a-Glucosidase (BM 1462309) is added to 
45 ml buffer (BM1442309) . The working solution is made by mixing 

10 5 ml a-Glucosidase solution with 0.5 ml substrate. 

The assay is performed by transforming 20]il enzyme solution 
to a 96 well microtitre plate and incubating at 25°C. 200 \il 
working solution, 25°C is added. The solution is mixed and pre- 
incubated 1 minute and absorption is measured every 15 sec. over 

15 3 minutes at OD 405 nm. 

The slope of the time dependent absorption-curve is directly 
proportional to the specific activity (activity per mg enzyme) 

of the a-amylase in question under the given set of conditions. 

20 General method for random mutagenesis by use of the DOPE 
program 

The random mutagenesis may be carried out by the following 
steps : 

1. Select regions of interest for modification in the parent 
25 enzyme 

2. Decide on mutation sites and non-mutated sites in the 
selected region 

3. Decide on which kind of mutations should be carried out, 
e.g. with respect to the desired stability and/or performance 

30 of the variant to be constructed 

4. Select structurally reasonable mutations. 

5. Adjust the residues selected by step 3 with regard to step 
4 . 

6. Analyze by use of a suitable dope algorithm the nucleotide 
35 distribution. 

7. If necessary, adjust the wanted residues to genetic code 
realism (e.g., taking into account constraints resulting from 
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the genetic code (e.g. in order to avoid introduction of stop 
codons) ) (the skilled person will be aware that some codon 
combinations cannot be used in practice and will need to be 
adapted) 

8 . Make primers 

9. Perform random mutagenesis by use of the primers 

10. Select resulting oc-amylase variants by screening for the 
desired improved properties. 

Suitable dope algorithms for use in step 6 are well known 
in the art. One algorithm is described by Tomandl, D. et al., 
Journal of Computer-Aided Molecular Design, 11 (1997), pp. 29- 
38) . Another algorithm, DOPE, is described in the following: 

The dope program 

The "DOPE" program is a computer algorithm useful to 
optimize the nucleotide composition of a codon triplet in such 
a way that it encodes an amino acid distribution which 
resembles most the wanted amino acid distribution. In order to 
assess which of the possible distributions is the most similar 
to the wanted amino acid distribution, a scoring function is 
needed. In the "Dope" program the following function was found 
to be suited: 



N 



f 



i=l 



X 



U"0 -*)'"'' J 



where the jc f . ' s are the obtained amounts of amino acids and 

groups of amino acids as calculated by the program, y. 1 s are 
the wanted amounts of amino acids and groups of amino acids as 
defined by the user of the program (e.g. specify which of the 
20 amino acids or stop codons are wanted to be introduced, e.g. 
with a certain percentage (e.g. 90% Ala, 3% lie, 7% Val) , and 

W; 1 s are assigned weight factors as defined by the user of the 
program (e.g., depending on the importance of having a specific 
amino acid residue inserted into the position in question) . N 
is 21 plus the number of amino acid groups as defined by the 
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user of the program. For purposes of this function 0° is 
defined as being 1. 

A Monte-Carlo algorithm (one example being the one 
described by Valleau, J. P. & Whittington, S.G. (1977) A guide 
5 to Mont Carlo for statistical mechanics: 1 Highways. In 
"Stastistical Mechanics, Part A" Equlibrium Techniqeues ed. 
B.J. Berne, New York: Plenum) is used for finding the maximum 
value of this function. In each iteration the following steps 
are performed: 

10 l.A new random nucleotide composition is chosen for each base, 
where the absolute difference between the current and the new 
composition is smaller than or equal to d for each of the 
four nucleotides G,A,T,C in all three positions of the codon 
(see below for definition of d) . 

15 2. The scores of the new composition and the current composition 
are compared by the use of the function s as described above. 
If the new score is higher or equal to the score of the 
current composition, the new composition is kept and the 
current composition is changed to the new one. If the new 

20 score is smaller, the probability of keeping the new 

composition is exp(\000(new score - current _score^ . 

A cycle normally consists of 1000 iterations as described 
above in which d is decreasing linearly from 1 to 0. One 
hundred or more cycles are performed in an optimization 
25 process. The nucleotide composition resulting in the highest 
score is finally presented. 

EXAMPLES 

30 EXAMPLE 1 

Example on Homology building of Termamyl™ 

The overall homology of the B. lichen! formis a-amylase (in 
the following referred to as Termamyl™) to other Termamyl-like 

a-amylases is high and the percent similarity is extremely high. 
35 The similarity calculated for Termamyl™ to BSG (the B . 

stearothermophilus a-amylase having SEQ ID NO: 3) , and BAN (the 
B. amyloliquefaciens a-amylase having SEQ ID NO: 5) using the 
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University of Wisconsin Genetics Computer Group's program GCG 
gave 89% and 78%, respectively. TERM has a deletion of 2 
residues between residue G180 and K181 compared to BAN™ and BSG. 
BSG has a deletion of 3 residues between G371 and 1372 in 
comparison with BAN™ and Termamyl™. Further BSG has a C-terminal 
extension of more than 20 residues compared to BAN™ and 
Termamyl™. BAN™ has 2 residues less and Termamyl has one residue 
less in the N-terminal compared to BSG. 

The structure of the B . licheniformis (Termamyl™) and of 

the B. amyloliquefaciens ot-amylase (BAN™) , respectively, was 
model built on the structure disclosed in Appendix 1 of WO 

96/23974. The structure of other Termamyl-like oc-amylases (e.g. 
those disclosed herein) may be built analogously. 

In comparison with the a-amylase used for elucidating the 
present structure, Termamyl™ differs in that it lacks two 
residues around 178-182. In order to compensate for this in the 
model structure, the HOMOLOGY program from BIOSYM was used to 
substitute the residues in equivalent positions in the structure 
(not only structurally conserved regions) except for the dele- 
tion point. A peptide bond was established between G179(G177) 
and K180(K180) in Termamyl™ (BAN™) . The close structural 
relationship between the solved structure and the model 
structure (and thus the validity of the latter) is indicated by 
the presence of only very few atoms found to be too close 
together in the model. 

To this very rough structure of Termamyl™ was then added 
all waters (605) and ions (4 Calcium and 1 Sodium) from the 
solved structure (See Appendix 1 of WO 96/23874) at the same 
coordinates as for said solved structure using the INSIGHT 
program. This could be done with only few overlaps - in other 
words with a very nice fit. This model structure were then 
minimized using 200 steps of Steepest descent and 600 steps of 
Conjugated gradient (see Brooks et al 1983, J. Computational 
Chemistry 4, p. 187-217). The minimized structure was then sub- 
jected to molecular dynamics, 5ps heating followed by up to 
200ps equilibration but more than 35ps. The dynamics as run with 
the Verlet algorithm and the equilibration temperature 300K were 
kept using the Behrendsen coupling to a water bath (Berendsen 
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et. al., 1984, J. Chemical Physics 81, p. 3684-3690). Rotations 
and translations were removed every pico second. 

EXAMPLE 2 

Method of extracting important regions for identifying a- 
amylase variants with improved pH stability and altered 
temperature activity 

The X-ray structure and/or the model build structure of 
the enzyme of interest, here SP722 and Termamyl™, are subjected 
to molecular dynamics simulations. The molecular dynamics 
simulation are made using the CHARMM (from Molecular 
simulations (MSI)) program or other suited program like, e.g., 
DISCOVER (from MSI). The molecular dynamic analysis is made in 
vacuum, or more preferred including crystal waters, or with the 
enzyme embedded in water, e.g., a water sphere or a water box. 
The simulation are run for 300 pico seconds (ps) or more, e.g., 
300-1200 ps. The isotropic fluctuations are extracted for the 
CA carbons of the structures and compared between the 
structures. Where the sequence has deletions and/or insertions 
the isotropic fluctuations from the other structure are 
inserted thus giving 0 as difference in isotropic fluctuation. 
For explanation of isotropic fluctuations see the CHARMM manual 
(obtainable from MSI) . 

The molecular dynamics simulation can be done using 
standard charges on the chargeable amino acids. This is Asp and 
Glu are negatively charged and Lys and Arg are positively 
charged. This condition resembles the medium pH of 
approximately 7. To analyze a higher or lower pH, titration of 
the molecule can be done to obtain the altered pKa' s of the 
standard titrateable residues normally within pH 2-10; Lys, 
Arg, Asp, Glu, Tyr and His. Also Ser, Thr and Cys are 
titrateable but are not taking into account here. Here the 
altered charges due to the pH has been described as both Asp 
and Glu are negative at high pH, and both Arg and Lys are 
uncharged. This imitates a pH around 10 to 11 where the 
titration of Lys and Arg starts, as the normal pKa of these 
residues are around 9-11. 
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1. The approach used for extracting important regions for 
identifying ot-amylase variants with high pH stability: 

The important regions for constructing variants with 
improved pH stability are the regions which at the extreme pH 
display the highest mobility, i.e., regions having the highest 
isotropic fluctuations . 

Such regions are identified by carrying out two molecular 
dynamics simulations: i) a high pH run at which the basic 
amino acids, Lys and Arg, are seen as neutral (i.e. not 
protonated) and the acidic amino acids, Asp and Glu, have the 
charge (-1) and ii) a neutral pH run with the basic amino 
acids, Lys and Arg, having the net charge of ( + 1) and the 
acidic amino acids having a charge of (-1) . 

The two run are compared and regions displaying the 
relatively higher mobility at high pH compared to neutral pH 
analysis were identified. 

Introduction of residues improving general stability, e.g., 
hydrogen bonding, making the region more rigid (by mutations 
such as Proline substitutions or replacement of Glycine 
residues) , or improving the charges or their interaction, 
improves the high pH stability of the enzyme. 

2. The approach used for extracting regions for identifying 

a-amylase variants with increased activity at medium 
temperatures : 

The important regions for constructing variants with 
increased activity at medium temperature was found as the 
difference between the isotropic fluctuations in SP722 and 
Termamyl, i.e., SP722 minus Termamyl CA isotrophic 
fluctuations, The regions with the highest mobility in the 
isotrophic fluctuations were selected. These regions and there 
residues were expected to increase the activity at medium 
temperatures. The activity of an alpha-amylase is only 
expressed if the correct mobility of certain residues are 
present. If the mobility of the residues is too low the 
activity is decreased or abandoned. 
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EXAMPLE 3 

Construction, by localized random, doped mutagenesis, of 
Termamyl-like a-amylase variants having an improved Ca2+ 
stability at medium temperatures compared to the parent enzyme 

5 To improve the stability at low calcium concentration of a- 

amylases random mutagenesis in pre-selected region was 
performed. 

Region: Residue: 
SA1: R181-W189 

10 The DOPE software (see Materials and Methods) was used to 

determine spiked codons for each suggested change in the SA1 
region minimizing the amount of stop codons (see table 1) . The 
exact distribution of nucleotides was calculated in the three 
positions of the codon to give the suggested population of 

15 amino acid changes. The doped regions were doped specifically 
in the indicated positions to have a high chance of getting the 
desired residues, but still allow other possibilities. 

Table 1: 

20 Distribution of amino acid residues for each position 

R181: 72% R, 2% N, 7% Q, 4% H, 4%K, 11%S 

G182: 73% G, 13% A, 12% S, 2% T 

K185: 95% K, 5% R 

A186: 50% A, 4% N, 6% D, 1%E, 1% G, 1% K, 5% S, 31% T 

25 W187 : 100% W 

D188: 100% D 

W189: 92% W, 8% S 

The resulting doped oligonucleotide strand is shown in 
30 table 2 as sense strand: with the wild type nucleotide and 
amino acid sequences and the distribution of nucleotides for 
each doped position. 



Table 2: 
35 Position 

Amino acid seq. 
Wt nuc. seq. 



181 
Arg 
cga 



182 
Gly 
ggt 



185 
Lys 
aaa 



186 
Ala 
get 



187 
Thr 
tgg 



188 
Asp 
gat 



189 
Thr 
tgg 



40 Forward primer (SEQ ID NO: 15) : 

FSA: 5'~caa aat cgt ate tac aaa ttc 123 456 a7g 8910 tgg 

gat tllg gaa gta gat teg gaa aat-3' 



Distribution of nucleotides for each doped Position 
45 1: 35% A, 65% C 
2: 83% G, 17% A 
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3: 63% G, 37% T 

4: 86% G, 14% A 

5: 85% G, 15% C 

6: 50% T, 50% C 

7: 95% A, 5%G 

8: 58% G, 37% A, 5% T 

9: 86% C, 13% A, 1% G 

10: 83% T, 17% G 

11: 92% G, 8% C 

Reverse primer (SEQ ID NO: 16) : 

RSA: 5'-gaa ttt gta gat acg att ttg-3' 

Random mutagenesis 

The spiked oligonucleotides apparent from Table 2 (which by 
a common term is designated FSA) and reverse primers RSA for 
the SA1 region and specific SEQ ID NO: 2: SP722 primers 
covering the SacII and the Drain sites are used to generate 
PCR-library-f ragments by the overlap extension method (Horton 
et al., Gene, 77 (1989), pp. 61-68) with an overlap of 21 base 
pairs. Plasmid pJEl is template for the Polymerase Chain 
Reaction. The PCR fragments are cloned in the E. coli/ Bacillus 
shuttle vector pDorklOl (see Materials and Methods) enabling 
mutagenesis in E. coli and immediate expression in Bacillus 
subtilis preventing lethal accumulation of amylases in E. coli. 
After establishing the cloned PCR fragments in E . coli, a 
modified pUC19 fragment is digested out of the plasmid and the 
promoter and the mutated Termamyl gene is physically connected 
and expression can take place in Bacillus . 

Screening 

The library may be screened in the low calcium filter 
assays described in the "Material and Methods" section above. 

EXAMPLE 4 

Construction of variants of amylase SEQ ID NO: 1 (SP690) 

The gene encoding the amylase from SEQ ID NO : 1 is located 
in a plasmid pTVB106 described in W096/23873. The amylase is 
expressed from the amyL promoter in this construct in Bacillus 
subtilis . 

A variant of the protein is delta (T18 3-G184 ) 
+Y243F+Q391E+K444Q. Construction of this variant is described 
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in W096/23873. 

Construction of delta (T183-G184 ) + N195F by the mega-primer 
method as described by Sarkar and Sommer, (1990), BioTechniques 
8: 404-407. 

Gene specific primer Bl (SEQ ID NO: 17) and mutagenic 
primer 101458 (SEQ ID NO: 19) were used to amplify by PCR an 
approximately 645 bp DNA fragment from a pTVB106-like plasmid 
(with the delta (T183-G184 ) mutations in the gene encoding the 
amylase from SEQ ID NO: 1) . 

The 645 bp fragment was purified from an agarose gel and 
used as a mega-primer together with primer Y2 (SEQ ID NO: 18) 
in a second PCR carried out on the same template. 

The resulting approximately 1080 bp fragment was digested 
with restriction enzymes BstEII and Afllll and the resulting 
approximately 510 bp DNA fragment was purified and ligated with 
the pTVB106-like plasmid (with the delta (T183-G184 ) mutations 
in the gene encoding the amylase from SEQ ID NO: 1) digested 
with the same enzymes. Competent Bacillus subtilis SHA27 3 
(amylase and protease low) cells were transformed with the 
ligation and Chlorampenicol resistant transf ormants and was 
checked by DNA sequencing to verify the presence of the correct 
mutations on the plasmid. 
primer Bl: (SEQ ID NO: 17) 
5' CGA TTG CTG ACG CTG TTA TTT GCG 3' 
primer Y2 : (SEQ ID NO: 18) 
5' CTT GTT CCC TTG TCA GAA CCA ATG 3' 
primer 101458 (SEQ ID NO: 19): 

5' GT CAT AGT TGC CGA AAT CTG TAT CGA CTT C 3' 

The construction of variant: delta (T183-G184 ) + K185R+A186T 
was carried out in a similar way except that mutagenic primer 
101638 was used. 

primer 101638: (SEQ ID NO: 20) 

5' CC CAG TCC CAC GTA CGT CCC CTG AAT TTA TAT ATT TTG 3' 

Variants: delta (T183-G18 4) +A18 6T, delta (T18 3-G18 4) +A18 6I, 
delta (T183-G184) +A186S, delta (T183-G184) +A186N are 

constructed by a similar method except that pTVB106-like 
plasmid (carrying variant delta (T183-G184 ) + K185R+A186T) is 
used as template and as the vector for the cloning purpose. The 
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mutagenic oligonucleotide (Oligo 1) is: 

5' CC CAG TCC CAG NTCTTT CCC CTG AAT TTA TAT ATT TTG 3' (SEQ ID 
NO: 21) 

N represents a mixture of the four bases: A, C, G, and T 
5 used in the synthesis of the mutagenicoli-gonucleotide . 

Sequencing of transf ormants identifies the correct codon for 
amino acid position 186 in the mature amylase. 

Variant: delta (T183-G184) + K185R+A186T+N195F is 

constructed as follows: 
10 PCR is carried out with primer x2 (SEQ ID NO: 22) and primer 
101458 (SEQ ID NO: 19) on pTVB106-like plasmid (with mutations 
delta (T183-G184) + K185R+A186T) . The resulting DNA fragment is 
used as a mega-primer together with primer Y2 (SEQ ID NO: 18) 
in a PCR on pTVB106-like plasmid (with mutations delta (T183- 
15 G184) + N195) . The product of the second PCR is digested with 
restriction endonucleases Acc65I and Afllll and cloned into 
pTVB106 like plasmid (delta (T183-G184 ) +N195F) digested with the 
same enzymes . 

primer x2 : (SEQ ID NO: 22) 
20 5' GCG TGG ACA AAG TTT GAT TTT CCT G 3' 

Variant: delta (T18 3-G184) + K18 5R+A18 6T+N195F+Y24 3F+ 
Q391E+K444Q is constructed as follows: 

PCR is carried out with primer x2 and primer 101458 on 
pTVB106-like plasmid (with mutations delta (T183-G184 ) + 
25 K185R+A186T) . The resulting DNA fragment is used as a mega- 
primer together with primer Y2 in a PCR on pTVB106 like plasmid 
(with mutations delta (T183-G184 ) +Y243F+Q391E+K444Q) . The 
product of the second PCR is digested with restriction 
endonucleases Acc65I and Afllll and cloned into pTVB106 like 
30 plasmid (delta (T183-G184 ) +Y243F+Q391E+K444Q) digested with the 
same enzymes . 

Example 5 

Construction of site-directed a- amylase variants in the parent 

35 SP722 g -amylase (SEQ ID NO: 2) 

Construction of variants of amylase SEQ ID NO: 2 (SP722) is 
carried out as described below. 

The gene encoding the amylase from SEQ ID NO: 2 is located 
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in a plasmid pTVB112 described in WO 96/23873, The amylase is 
expressed from the amyL promoter in this construct in Bacillus 
subtilis . 

Construction of delta (D183-G184 ) + V56I by the mega-primer 
method as described by Sarkar and Sommer, 1990 (BioTechniques 
8 : 404-407) . 

Gene specific primer DA03 and mutagenic primer DA07 are 
used to amplify by PCR an approximately 820 bp DNA fragment 
from a pTVB112-like plasmid (with the delta ( D183-G184 ) 

mutations in the gene encoding the a-amylase shown in SEQ ID 

NO: 2 . 

The 820 bp fragment is purified from an agarose gel and 
used as a mega-primer together with primer DA01 in a second PCR 
carried out on the same template. 

The resulting approximately 920 bp fragment is digested 
with restriction enzymes NgoM I and Aat II and the resulting 
approximately 170 bp DNA fragment is purified and ligated with 
the pTVB112-like plasmid (with the delta (D183-G184 ) mutations 
in the gene encoding the amylase shown in SEQ ID NO: 2) 
digested with the same enzymes. Competent Bacillus subtilis 
SHA273 (amylase and protease low) cells are transformed with 
the ligation and Chlorampenicol resistant transf ormants are 
checked by DNA sequencing to verify the presence of the correct 
mutations on the plasmid. 
primer DA01: (SEQ ID NO: 23) 
5' CCTAATGATGGGAATCACTGG 3' 
primer DA03: (SEQ ID NO:24) 
5' GCATTGGATGCTTTTGAACAACCG 3' 
primer DA07 (SEQ ID NO:25): 
5' CGCAAAATGATATCGGGTATGGAGCC 3' 

Variants: delta (D183-G184 ) + K108L, delta (D183-G184 ) + K108Q, 
delta (D183-G184) + K108E, delta (D183-G184 ) + K108V, were 
constructed by the mega-primer method as described by Sarkar 
and Sommer ,1990 (BioTechniques 8: 404-407): 

PCR is carried out with primer DA03 and mutagenesis primer 
DA20 on pTVB112-like plasmid (with mutations delta ( D183-G184 )) . 
The resulting DNA fragment is used as a mega-primer together 
with primer DA01 in a PCR on pTVB112-like plasmid (with 
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mutations delta ( D183-G184 )) . The approximately 920 bp product 
of the second PCR is digested with restriction endonucleases 
Aat II and Mlu I and cloned into pTVB112-like plasmid 
(delta (D183-G184 ) ) digested with the same enzymes, 
primer DA20 (SQ ID NO:26): 
5' G T GAT G AAC C AC S WAG G T G GAG C T GAT G C 3' 

S represents a mixture of the two bases: C and G used in 
the synthesis of the mutagenic oligonucleotide and W represents 
a mixture of the two bases: A and T used in the synthesis of 
the mutagenic oligonucleotide. 

Sequencing of transf ormants identifies the correct codon 
for amino acid position 108 in the mature amylase. 

Construction of the variants: delta ( D18 3-G184 ) + D168A, 
delta (D183-G184) + D168I, delta (D183-G184 ) + D168V, delta (D183- 
G184) + D168T is carried out in a similar way except that 
mutagenic primer DA14 is used, 
primer DA14 (SEQ ID NO:27): 
5' GATGGTGTATGGRYCAATCACGACAATTCC 3' 

R represents a mixture of the two bases: A and G used in 
the synthesis of the mutagenic oligonucleotide and Y represents 
a mixture of the two bases: C and T used in the synthesis of 
the mutagenic oligonucleotide. 

Sequencing of transf ormants identifies the correct codon 
for amino acid position 168 in the mature amylase. 

Construction of the variant: delta (D183-G184 ) + Q169N is 
carried out in a similar way except that mutagenic primer DA15 
is used. 

primer DA15 (SEQ ID NO:28): 

5' G G T G T AT G G GAT AAC T C AC G AC AAT T C C 3' 

Construction of the variant: delta (D183-G184 ) + Q169L is 
carried out in a similar way except that mutagenic primer DA16 
is used. 

primer DA16 (SEQ ID NO:29): 

5' GGTGTATGGGATCTCTCACGACAATTCC 3' 

Construction of the variant: delta (D183-G184 ) + Q172N is 
carried out in a similar way except that mutagenic primer DA17 
is used. 

primer DA17 (SEQ ID NO:30): 
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5' G G GAT C AAT C AC G AAAT T T C C AAAAT C G T AT C 3' 

Construction of the variant: delta ( D183-G184 ) + Q172L is 
carried out in a similar way except that mutagenic primer DA18 
is used. 

primer DA18 (SEQ ID N0:31): 

5' GGGATCAATCACGACTCTTCCAAAATCGTATC 3' 

Construction of the variant: delta ( D183-G184 ) + . L201I is 
carried out in a similar way except that mutagenic primer DA06 
is used. 

primer DA06 (SEQ ID NO:32): 

5' G G AAAT TAT GAT TAT AT CAT G T AT G C AG AT G TAG 3' 

Construction of the variant: delta (D183-G184 ) + K269S is 
carried out in a similar way except that mutagenic primer DA09 
is used. 

primer DA09 (SEQ ID NO: 33): 

5' GCTGAATTTTGGTCGAATGATTTAGGTGCC 3' 

Construction of the variant: delta (D183-G184 ) + K269Q is 
carried out in a similar way except that mutagenic primer DA11 
is used. 

primer DA11 (SEQ ID NO:34): 

5' GCTGAATTTTGGTCGAATGATTTAGGTGCC 3' 

Construction of the variant: delta ( D183-G184 ) + N270Y is 
carried out in a similar way except that mutagenic primer DA21 
is used. 

primer DA21 (SEQ ID NO:35): 

5' GAATTTTGGAAGTACGATTTAGGTCGG 3' 

Construction of the variants: delta ( D183-G184 ) + L272A, 
delta (D183-G184) + L272I, delta (D183-G184 ) + L272V, delta (D183- 
G184) + L272T is carried out in a similar way except that 
mutagenic primer DA12 is used, 
primer DA12 (SEQ ID NO: 36): 
5' GGAAAAACGATRYCGGTGCCTTGGAGAAC 3' 

R represents a mixture of the two bases: A and G used in the 
synthesis of the mutagenic oligonucleotide and Y represents a 
mixture of the two bases: C and T used in the synthesis of the 
mutagenic oligonucleotide . 

Sequencing of transf ormants identifies the correct codon for 
amino acid position 272 in the mature amylase. 
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Construction of the variants: delta (D183-G184 ) + L275A, 
delta (D183-G184) + L275I, delta (D183-G184 ) + L275V, delta (D183- 
G184) + L275T is carried out in a similar way except that 
mutagenic primer DA13 is used. 
5 primer DA13 (SEQ ID NO:37): 

5' GATTTAGGTGCCTRYCAGAACTATTTA 3' 

R represents a mixture of the two bases: A and G used in the 
synthesis of the mutagenic oligonucleotide and Y represents a 
mixture of the two bases: C and T used in the synthesis of the 
10 mutagenic oligonucleotide. 

Sequencing of transf ormants identifies the correct codon for 
amino acid position 275 in the mature amylase. 

Construction of the variant: delta ( D18 3-G18 4 ) + Y295E is 
carried out in a similar way except that mutagenic primer DA08 

15 is used. 

primer DA08 (SEQ ID NO:38): 

5' CCCCCTTCATGAGAATCTTTATAACG 3' 

Construction of delta (D183-G184 ) + K446Q by the mega-primer 

method as described by Sarkar and Sommer,1990 (BioTechniques 8: 
20 4 0 4 -407) : 

Gene specific primer DA04, annealing 214-231 bp downstream 
relative to the STOP-codon and mutagenic primer DA10 were used 
to amplify by PCR an approximately 350 bp DNA fragment from a 
pTVB112-like plasmid (with the delta (D183-G184 ) mutations in 

25 the gene encoding the amylase depicted in SEQ ID NO: 2) . 

The resulting DNA fragment is used as a mega-primer 
together with primer DA05 in a PCR on pTVB112 like plasmid 
(with mutations delta ( D18 3-G184 )) . The app . 460 bp product of 
the second PCR is digested with restriction endonucleases SnaB 

30 I and Not I and cloned into pTVB112 like plasmid (delta (D183- 
G184)) digested with the same enzymes, 
primer DA04 (SEQ ID NO: 39): 
5' GAAT C C G AAC C T C AT T AC AC AT T C G 3' 
primer DA05 (SEQ ID NO:40): 

35 5' C G GAT G G AC T C G AG AAG G AAAT AC C AC G 3' 
primer DA10 (SEQ ID NO: 41): 
5' CGTAGGGCAAAATCAGGCCGGTCAAGTTTGG 3' 

Construction of the variants: delta (D183-G184 ) + K458R is 
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carried out in a similar way except that mutagenic primer DA22 
is used. 

primer DA22 (SEQ ID NO:42): 

5 ' CATAACTGGAAATCGCCCGGGAACAGTTACG 3' 

Construction of the variants: delta (D183-G184 ) + P459S and 
delta (D183-G184 ) + P459T is carried out in a similar way except 
that mutagenic primer DA19 is used, 
primer DA19 (SEQ ID NO:43): 
5' CTGGAAATAAAWCCGGAACAGTTACG 3' 

W represents a mixture of the two bases: A and T used in the 
synthesis of the mutagenic oligonucleotide. 

Sequencing of transf ormants identifies the correct codon for 
amino acid position 459 in the mature amylase. 

Construction of the variants: delta ( D183-G184 ) + T461P is 
carried out in a similar way except that mutagenic primer DA23 
is used. 

primer DA23 (SEQ ID NO:44): 

5' G G AAAT AAA C C AG G AC C C G T T AC GAT C AAT G C 3' 

Construction of the variant: delta (D183-G184 ) + K142R is 
carried out in a similar way except that mutagenic primer DA32 
is used. 

Primer DA32 (SEQ ID NO: 45) : 

5' GAGGCTTGGACTAGGTTTGATTTTCCAG 3' 

Construction of the variant: delta (D183-G184 ) + K269R is 
carried out in a similar way except that mutagenic primer DA31 
is used. 

Primer DA31 (SEQ ID NO: 46) : 

5' GCTGAATTTTGGCGCAATGATTTAGGTGCC 3' 



Example 6 

Construction of site-directed a-amylase variants in the parent 
Termamyl a -amylase (SEQ ID NO: 4) 

The amyL gene, encoding the Termamyl a-amylase is located 
in plasmid pDN1528 described in WO 95/10603 (Novo Nordisk) . 
Variants with substitutions N265R and N265D, respectively, of 

said parent a-amylase are constructed by methods described in 
WO 97/41213 or by the "megaprimer" approach described above. 
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Mutagenic oligonucleotides are: 
Primer bll for the N265R substitution: 

5' PCC AGC GCG CCT AGG TCA CGC TGC CAA TAT TCA G (SEQ ID NO: 
56) 

Primer bl2 for the N265D substitution: 

5' PCC AGC GCG CCT AGG TCA TCC TGC CAA TAT TCA G (SEQ ID NO: 
57) 

P represents a phosphate group. 
Example 7 

Determination of pH stability at alkaline pH of variants of the 
parent a -Amylase having the amino acid sequence shown in SEQ ID 
NO: 2. 

In this serie of analysis purified enzyme samples were used. 
The measurements were made using solutions of the respective 
variants in 100 mM CAPS buffer adjusted to pH 10.5. The 

solutions were incubated at 75°C. 

After incubation for 20 and 30 min the residual activity was 
measured using the PNP-G7 assay (described in the "Materials 
and Methods" section above) . The residual activity in the 
samples was measured using Britton Robinson buffer pH 7.3. The 
decline in residual activity was measured relative to a 
corresponding reference solution of the same enzyme at 0 

minutes, which has not been incubated at high pH and 75°C. 



The percentage of the initial activity as a function is shown 
in the table below for the parent enzyme (SEQ ID NO: 2) and for 
the variants in question. 



variant 


Residual activity 
after 20 min 


Residual activity 
after 30 min 


A(D183-G184) +M323L 


bb % 


44 % 


A(D18 3-G18 4) +M323L+R181S 


bl % 


bb % 


A(D183-G184) +M323L+A186T 


b2 % 


bU % 



In an other series of analysis culture supernatants were 
used. The measurements were made using solutions of the 
respective variants in 100 mM CAPS buffer adjusted to pH 10.5. 
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The solutions were incubated at 80°C. 

After incubation for 30 minutes the residual activity was 
measured using the Phadebas assay (described in the "Materials 
and Method" secion above. The residual activity in the samples 
5 was measured using Britton Robinson buffer pH 7.3. The decline 
in residual activity was measured relative to a corresponding 
reference solution of the same enzyme at 0 minutes, which has 

not been incubated at high pH and 80°C. 

The percentage of the initial activity as a function is shown 
10 in the table below for the parent enzyme (SEQ ID NO: 2) and for 
the variants in question. 



Variant 


Residual actxvxty atter mm 


A(D183-G184) 


4 % 


A (D183-G184 ) +P459T 


2b % 


A(D18 3-G18 4) +K4 58R 


31 % 


A(D183-G184)+K311R 


1U % 



Example 8 

Determination of calcium stability at alkaline pH of variants 
15 of the parent g-Amylase having the amino acid sequence shown in 
SEQ ID NO: 1, SEQ ID NO: 2 and SEQ ID NO: 4. 

A: Calcium stability of variants of the sequence in SEQ ID NO:l 
The measurement were made using solutions of the respective 
20 variants in 100 mM CAPS buffer adjusted to pH 10.5 to which 
polyphosphate was added (at time t = 0) to give a final 

concentration of 2400 ppm. The solutions were incubated at 50°C. 

After incubation for 20 and 30 minutes the residual activity 
was measured using the PNP-G7 assay (described above) . The 
25 residual activity in the samples was measured using Britton 
Robinson buffer pH 7.3. The decline in residual activity was 
measured relative to a corresponding reference solution of the 
same enzyme at 0 minutes, which has not been incubated at high 

pH and 50°C. 

30 The percentage of the initial activity as a function is shown 

in the table below for the parent enzyme (SEQ ID NO: 1) and for 
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the variants in question. 



Variant 


Residual activity 
after 20 min 


Residual activity 
after 30 min 


A(T183-G184) 




ly % 


A (T183-G184 ) + A186T 


Jb % 


23 % 


A(T183-G184) +K185R+A186T 


4b % 


ky % 


A (T18 3-G18 4 ) +A18 6I 






A(T183-G184) +N195F 


44 % 


n . d . 



n.a.= Not aetermmatea 



B: Calcium stability of variants of the sequence in SEQ ID NO: 2 
5 In this series of analysis purified samples of enzymes were 

used. The measurement were made using solutions of the 
respective variants in 100 mM CAPS buffer adjusted to pH 10.5 
to which polyphosphate was added (at time t = 0) to give a final 

concentration of 2400 ppm. The solutions were incubated at 50°C. 

10 After incubation for 20 and 30 minutes the residual activity 

was measured using the PNP-G7 assay (described above) . The 
residual activity in the samples was measured using Britton 
Robinson buffer pH 7.3. The decline in residual activity was 
measured relative to a corresponding reference solution of the 

15 same enzyme at 0 minutes, which has not been incubated at high 

pH and 50°C. 

The percentage of the initial activity as a function is shown 
in the table below for the parent enzyme (SEQ ID NO: 2) and for 
the variants in question. 



20 



Variant 


Residual activity 
after 20 min 


Residual activity 
after 30 min 


A(D183-G184) +M323L 


21 % 


13 % 


A(D18 3-G18 4) +M32 3L+R18 IS 


Sz % 


19 % 


A(D183-G184) +M32 3L+A18 6T 


2 O *6 


1 / % 


A(D183-G184) +M323L+A18 6R 


3U % 


18 % 




Variant 


Residual activity 
after 20 min 


Residual activity 
after 30 min 
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A(D183-G184) 






A(D183-G184) +N195F 




44% 



In this serie of analysis culture supernatants were used. The 
measurement were made using solutions of the respective 
variants in 100 mM CAPS buffer adjusted to pH 10.5 to which 
polyphosphate was added (at time t=0) to give a final 

concentration of 2400 ppm. The solutions were incubated at 50°C. 

After incubation for 30 minutes the residual activity was 
measured using the Phadebas assay as described above. The 
residual activity in the samples was measured using Britton 
Robinson buffer pH 7.3. The decline in residual activity was 
measured relative to a corresponding reference solution of the 
same enzyme at 0 minutes, which has not been incubated at high 

pH and 50°C. 

The percentage of the initial activity as a function is shown 
in the table below for the parent enzyme (SEQ ID NO: 2) and for 
the variants in question. 



variant 


Residual activity atter JU min 


A(D183-G184) 


U % 


A (D183-G184 ) +P459T 


19 % 


A (D183-G184 ) 4-K458R 


lb % 


A(D183-G184)+T461P 


13 % 


A (D18 3-G18 4) +E34 6Q+K38 5R 


4 % 



C: Calcium stability of variants of the sequence in SEQ ID NO: 4 

The measurement were made using solutions of the respective 
variants in 100 mM CAPS buffer adjusted to pH 10.5 to which 
polyphosphate was added (at time t = 0) to give a final 

concentration of 2400 ppm. The solutions were incubated at 60°C 
for 20 minutes. 

After incubation for 20 minutes the residual activity was 
measured using the PNP-G7 assay (described above) . The residual 
activity in the samples was measured using Britton Robinson 
buffer pH 7.3. The decline in residual activity was measured 
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relative to a corresponding reference solution of the same 
enzyme at 0 minutes, which has not been incubated at high pH 

and 60°C. 

The percentage of the initial activity as a function is shown 

5 in the table below for the parent enzyme (SEQ ID NO: 4) and for 
the variants in question. 



variant 


Residual 
activity after 
20 min 


Termamyl (SEQ lb nu: 4) 


1 / % 


N2bbK 


28 % 


N2bbU 


2b % 



Example 9 : 

10 Activity measurement at medium temperature of a- Amylases having 
the amino acid sequence shown in SEQ ID NO: 1. 

A: a-Amylase activity of variants of the sequence in SEQ ID 
NO: 1 

The measurement were made using solutions of the respective 
15 variants in 50 mM Britton Robinson buffer adjusted to pH 7.3 
and using the Phadebas assay described above. The activity in 

the samples was measured at 37°C using 50 mM Britton Robinson 

buffer pH 7.3 and at 25°C using 50 mM CAPS buffer pH 10.5. 

The temperature dependent activity and the percentage of the 

20 activity at 25°C relative to the activity at 37°C are shown in 
the table below for the parent enzyme (SEQ ID NO: 1) and for 
the variants in question. 



Variant 


NU/mg 25°C 


NU/mg 37°C 


NU (25°C) 
/ NU(37°C) 




1440 




4 . i % 


A(T183-G184) 


2yuu 


4UUUU 


/ . 3 % 


A(T183-G184) +K269S 


i860 


12UUU 


ib.b % 


A(Q174) 


383U 




/ . 9 % 



Another measurement was made using solutions of the 
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respective variants in 50 mM Britton Robinson buffer adjusted 
to pH 7.3 and using the Phadebas assay described above. The 

activity in the samples was measured at 37°C and 50°C using 50 
mM Britton Robinson buffer pH 7.3. 
5 The temperature dependent activity and the percentage of the 

activity at 37°C relative to the activity at 50°C is shown in 
the table below for the parent enzyme (SEQ ID NO: 1) and for 
the variants in question. 



Variant 


NU/mg 37°C 


NU/mg 50°C 


NU(37°C) / 
NU (50°C) 


(seq ID NO: 1) 


13U9U 




bU % 


K2b9Q 


/bU4 


lUUbJ 


/b % 



10 

B : g-Amylase activity of variants of the sequence in SEQ ID 
NO: 2 

The measurement were made using solutions of the respective 
variants in 50 mM Britton Robinson buffer adjusted to pH 7.3 
15 and using the Phadebas assay described above. The activity in 

the samples was measured at both 25°C and 37°C using 50 mM 
Britton Robinson buffer pH 7.3. 

The temperature dependent activity and the percentage of the 

activity at 25°C relative to the activity at 37°C is shown in 
20 the table below for the parent enzyme (SEQ ID NO: 2) and for 
the variants in question. 



Variant 


NU/mg 
2 5°C 


NU/mg 
37°C 


NU(25°C) / 
NU (37°C) 


A(D183-G184) +M323L 


3U49 




3U % 


A(D183-G18 4) +M323L+R181S 


lBbyb 


3643b 


bl % 



C : g -Amylase activity of variants of the sequence in SEQ ID 
NO: 4 

25 The measurement were made using solutions of the respective 

variants in 50 mM Britton Robinson buffer adjusted to pH 7.3 
and using the Phadebas assay described above. The activity in 
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the samples was measured at both 37°C using 50 mM Britton 

Robinson buffer pH 7.3 and at 60°C using 50 mM CAPS buffer pH 
10.5. 

The temperature dependent activity and the percentage of the 

5 activity at 37°C relative to the activity at 60°C is shown in 
the table below for the parent enzyme (SEQ ID NO: 4) and for 
the variants in question. 



variant 


NU/mg 37°C 


NU/mg 60°C 


NU(37°C) / NU(60°C) 


Termamyi 


/4UU 


4 JbU 


1 /U % 




1UUUU 


4bbU 


21b % 
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Example 10 

Construction of variants of parent hybrid BAN : 1 - 
300/Termamyl : 301-483 a-amylase 

Plasmid pTVB191 contains the gene encoding hybrid a-amylase 
BAN : 1-300/Termamyl : 301-483 as well as an origin of replication 
functional in Bacillus subtilis and the cat gene conferring 
chloramphenicol resistance . 

Variant BM4 (F290E) was constructed using the megaprimer 
approach (Sarkar and Sommer, 1990) with plasmid pTVB191 as 
template . 

Primer pi (SEQ ID NO: 52) and mutagenic oligonucleotide bm4 
(SEQ ID NO: 47) were used to amplify a 444 bp fragment with 
polymerase chain reaction (PCR) under standard conditions. 
This fragment was purified from an agarose gel and used as 
'Megaprimer' in a second PCR with primer p2 (SEQ ID NO: 53) 
resulting in a 531 bp fragment. This fragment was digested with 
restriction endonucleases HinDIII and Tthllll. The 389 bp 
fragment produced by this was ligated into plasmid pTVB191 that 
had been cleaved with the same two enzymes. The resulting 
plasmid was transformed into B. subtilis SHA27 3. Chloramphenicol 
resistant clones were selected by growing the transf ormants on 
plates containing chloramphenicol as well as insoluble starch. 

Clones expressing an active a-amylase were isolated by selecting 
clones that formed halos after staining the plates with iodine 
vapour. The identity of the introduced mutations was confirmed 
by DNA sequencing. 

Variants BM5(F290K), BM6(F290A), BM8 (Q360E) and BM11 (N102D) 
were constructed in a similar way. Details of their construction 
are given below. 

Variant : BM5 (F290K) 

mutagenic oligonucleotide: bm5 (SEQ ID NO: 48) 
Primer (1st PCR): pi (SEQ ID NO: 52) 
Size of resulting fragment: 444 bp 
Primer (2nd PCR): p2 (SEQ ID NO: 53) 
Restriction endonucleases: HinDIII, Tthllll 
Size of cleaved fragment: 389 bp 
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Variant: BM6 (F290A) 

mutagenic oligonucleotide: bm6 (SEQ ID NO: 49) 
Primer (1st PCR) : pi (SEQ ID NO: 52) 
Size of resulting fragment: 444 bp 
Primer (2nd PCR) : p2 (SEQ ID NO: 53) 
Restriction endonucleases : HinDIII, Tthllll 
Size of cleaved fragment: 389 bp 

Variant: BM8 (Q360E) 

mutagenic oligonucleotide: bm8 (SEQ ID NO: 50) 
Primer (1st PCR) : pi (SEQ ID NO: 52) 
Size of resulting fragment: 230 bp 
Primer (2nd PCR) : p2 (SEQ ID NO: 53) 
Restriction endonucleases: HinDIII, Tthllll 
Size of cleaved fragment: 389 bp 

Variant: BM11 (N102D) 

mutagenic oligonucleotide: bmll (SEQ ID NO: 51) 
Primer (1st PCR): p3 (SEQ ID NO: 54) 
Size of resulting fragment: 577 
Primer (2nd PCR) : p4 (SEQ ID NO: 55) 
Restriction endonucleases: HinDIII, Pvul 
Size of cleaved fragment: 576 



Mutagenic oligonucleotides : 
bm4 (SEQ ID NO: 47) : F290E 

primer 5 ! GTG TTT GAC GTC CCG CTT CAT GAG AAT TTA CAG G 
bm5 (SEQ ID NO: 48) : F290K 

primer 5 1 GTG TTT GAC GTC CCG CTT CAT AAG AAT TTA CAG G 
bm6 (SEQ ID NO: 49) : F290A 

primer 5 ? GTG TTT GAC GTC CCG CTT CAT GCC AAT TTA CAG G 
bm8 (SEQ ID NO: 50) : Q360E 

primer 5* AGG GAA TCC GGA TAC CCT GAG GTT TTC TAC GG 
bmll (SEQ ID NO: 51) : N102D 

primer 5 ! GAT GTG GTT TTG GAT CAT AAG GCC GGC GCT GAT G 



Other primers: 
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pi: 5' CTG TTA TTA ATG CCG CCA AAC C (SEQ ID NO: 52) 
p2: 5' G GAA AAG AAA TGT TTA CGG TTG CG (SEQ ID NO: 53) 
p3: 5' G AAA TGA AGC GGA ACA TCA AAC ACG (SEQ ID NO: 54) 
p4: 5 ! GTA TGA TTT AGG AGA ATT CC (SEQ ID NO: 55) 

Example 11 

a -Amylase activity at alkaline pH of variants of parent BAN:1- 

3 00/Termamyl : 301-483 hybrid a -amylase. 

The measurements were made using solutions for the 
respective enzymes and utilizing the Phadebas assay (described 
above) . The activity was measured after incubating for 15 

minutes at 30°C in 50 mM Britton-Robinson buffer adjusted to the 
indicated pH by NaOH. 

NU/mg enzyme 

pH wt Q360E F290A F290K F290E N102D 

8.0 5300 7800 8300 4200 6600 6200 

9.0 1600 2700 3400 2100 1900 1900 
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CLAIMS 

1. A variant of a parent Termamyl-like oc-amylase, which variant 

has a-amylase activity, said variant comprises one or more 
mutations corresponding to the following mutations in the amino 
acid seguence shown in SEQ ID NO: 2: 

T141, K142, F143, D144, F145, P146, G147, R148, G149, 
Q174, R181, G182, D183, G184, K185, A186, W189, S193, N195 
H107, K108, G109,D166, W167, D168, Q169, S170, R171, Q172, F173, 
F267, W268, K269, N270, D271, L272, G273, A274, L275, K311, 
E346, K385, G456, N457, K458,P459, G460, T461, V462, T463. 

2. The variant according to claim 1, which variant has one or 
more of the following substitutions or deletions: 

T141A, D,R,N,C,E,Q,G,H, I,L,K,M, F, P,S,W, Y,V; 
K142A, D,R,N,C,E,Q,G,H,I,L,M,F,P,S,T,W,Y,V; 
F143A / D,R,N,C,E,Q,G,H,I,L,K,M,P,S,T,W,Y / V; 
D14 4A,R,N,C,E,Q,G,H, I , L, K, M, F, P, S, T, W, Y, V; 
F145A, D,R,N,C,E,Q,G,H,I,L,K,M,P,S,T,W,Y,V; 
P14 6A,D,R,N,C,E,Q,G,H, I , L, K, M, F, S, T, W, Y, V; 
G147A,D,R,N,C,E / Q,H, I , L, K, M, F, P, S , T, W, Y, V; 
R14 8A, D,N,C,E,Q,G,H,I,L,K,M,F,P,S,T,W,Y,V; 
G14 9A, D,R,N,C,E,Q,H,I,L,K,M,F,P,S,T,W,Y,V; 
R181*,A, D,N,C,E,Q,G,H, I,L,K,M,F, P, S,T,W, Y,V; 
G182*,A, D,R,N,C,E,Q,H,I,L,K,M,F,P,S,T,W,Y,V; 
D183*,A,R,N,C,E,Q,G,H, I , L, K, M, F, P, S , T , W, Y , V; 
G184*,A,R,D,N,C,E,Q,H,I,L,K,M,F,P,S,T,W,Y,V; 
K185A, D,R,N,C,E,Q,G,H,I,L,M,F, P,S,T,W,Y,V; 
A18 6D, R,N,C,E,Q,G,H, I,L,K,M, F, P, S,T,W,Y,V; 
W18 9A, D,R,N, C,E,Q, G, H, I, L, K,M, F, P, S,T, Y, V; 
S193A,D,R,N,C,E,Q,G,H, I,L,K,M, F, P , T , W , Y , V ; 
N195A,D,R,C,E,Q,G,H, I,L,K,M,F, P,S,T,W, Y,V; 
H107A, D,R,N,C,E,Q,G, I,L,K,M, F, P,S,T,W, Y, V; 
K108A, D,R,N,C,E,Q,G,H,I,L,M,F,P,S,T,W,Y,V; 
G10 9A, D,R,N,C,E,Q,H, I, L, K,M, F, P,S,T,W,Y,V; 
Dl 66A, R,N,C,E,Q,G,H,I,L,K,M,F,P,S,T,W,Y,V; 
Wl 67A, D,R,N,C,E,Q,G,H,I,L,K,M,F,P,S,T,Y,V; 
D168A,R f N,C,E,Q,G,H, I,L,K,M, F, P , S , T , W, Y, V; 



WO 99/23211 PCT/DK98/00471 - 

61 

Q169A,D,R,N,C,E,G,H, I,L,K f M, F, P, S , T, W, Y, V; 
S170A, D,R,N,C,E,Q,G,H,I,L,K,M,F,P,T,W,Y,V; 
R171A, D,N,C,E,Q,G,H, I,L,K,M, F, P,S,T,W, Y,V; 
Q172A, D,R,N, C,E,G,H, I,L,K,M, F,P,S,T,W, Y,V; 
5 F17 3A, D,R,N,C,E,Q,G,H,I,L,K,M,P,S,T,W,Y,V; 
Q17 4*,A, D,R,N,C,E,G,H,I,L,K,M,F,P,S,T,W,Y,V; 

F267A, D,R,N,C,E,Q,G,H,I,L,K,M,P,S,T,W,Y,V; 
W2 68A, D,R,N,C,E,Q,G,H, I , L , K , M , F, P , S , T , Y , V ; 

K2 6 9 A ,D,R,N,C,E,Q,G,H,I,L,M,F,P,S,T,W,Y,V; 
10 N27 0A,D,R,C,E,Q,G,H,I,L,K,M,F,P,S,T,W,Y,V; 

D271A,R,N,C,E,Q,G,H,I,L,K,M,F,P,S,T,W, Y,V; 

L272A, D,R,N,C,E,Q,G,H, I,K,M, F, P, S,T,W,Y,V; 

G273A,D,R,N,C,E,Q,H,I,L,K,M, F, P, S , T , W, Y , V; 

A27 4D,R,N,C,E,Q,G,H, I,L,K,M, F, P, S , T , W, Y , V; 
15 L275A,D,R,N,C,E,Q,G,H, I,K,M, F, P, S, T, W, Y, V; 

K311A,D,R,N,C,E,Q,G,H, I,L,M,F, P,S,T,W,Y,V; 

E34 6A, D,R,N,C,Q, G,H, I, K,L,M, F, P, S, T, W, Y, V; 

K385A, D,R,N,C,E,Q,G,H,I,L,M,F,P,S,T,W,Y,V; 

G456A, D,R,N,C,E,Q,H, I,L,K,M,F,P,S,T,W, Y,V; 
20 N4 57A, D,R,C,E,Q,G,H, I,L,K,M, F, P, S , T , W, Y , V; 

K4 58A, D,R,N,C,E,Q,G,H,I,L,M,F,P,S,T,W,Y,V; 

P459A, D,R,N,C,E,Q,G,H,I,L,K,M,F,S,T,W,Y,V; 

G4 60A, D,R,N,C,E,Q,H,I,L,K,M,F,P,S,T,W,Y,V; 

T4 61A, D,R,N,C,E,Q,G,H, I,L,K,M, F, P, S,W,Y,V; 
25 V4 62A, D,R,N,C,E,Q,G,H,I,L,K,M,F,P,S,T,W,Y; 

T4 63A, D,R,N,C,E,Q,G,H, I,L,K,M, F, P, S,W, Y, V. 

3. The variant according to claim 2, wherein the variant has one 
or more of the following substitutions or deletions: 
30 K142R; S193P; N195F; K269R,Q; N270Y,R f D; K311R; E346Q; K385R; 
K458R; P459T; T461P; Q174*; R181Q,N,S; G182T,S,N; D183*; G184*; 
K18 5A,R, D,C,E,Q,G,H,I,L,M,N,F,P,S,T,W,Y,V; Al 8 6T , S , N , I , V , R ; 

W189T, S,N,Q. 

35 4. The variant according to claims 1-3, wherein the variant has 
a deletion in position D183 + G184, and further one or more of 
the following substitutions or deletions: K142R; S193P; N195F; 
K269R,Q; N270Y,R,D; K311R; E346Q; K385R; K458R; P459T; T461P; 
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Q174*; R181Q,N,S; G182T,S,N; D183*; G184*; 

K18 5A,R, D,C,E,Q,G,H, 1 , L , M , N , F , P , S,T,W,Y,V; A18 6T , S , N , I , V, R; 
W189T, S, N, Q. 

5. The variant according to any of claims 1-4, wherein the 
variants exhibits an alteration in at least one of the following 

properties relative to the parent a-amylase: 

i) improved pH stability at a pH from 8 to 10.5; and/or 

ii) improved Ca 2+ stability at pH 8 to 10.5, and/or 

iii) increased specific activity at temperatures from 10 to 60°C, 
preferably 20-50°C, especially 30-40°C. 

6. The variant according to any of claims 1-5, exhibiting 
improved stability at pH 8 to 10.5, having mutations in one or 
more of the position (s) corresponding to the following positions 

(using SEQ ID NO: 2 numbering) : T141, K142, F143, D144, F145, 
P146, G147, R148, G149, R181, A186, S193, N195, K269, N270, 
K311, K458, P459, T461. 

7. The variant according to claim 6, which variant has one or 
more of the following substitutions: 



T141A, 


D, 


R, 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


w, 


Y, 


V; 


K142A, 


D, 


R, 


N , 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, 


V; 


F143A, 


D, 


R, 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


M, 


P, 


s, 


T, 


w, 


Y, 


V; 


D144A, 


R, 


N , 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T , 


w, 


Y, 


V; 


F145A, 


D, 


R, 


N , 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


M, 


P, 


s, 


T , 


w, 


V 


V; 


P146A, 


D, 


R, 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


s, 


T , 


w, 


Y, 


V; 


G147A, 


D, 


R, 


N , 


c, 


E, 


Q, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, 


V; 


R148A, 


D, 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s # 


T , 


J, 


Y, 


V; 


G149A, 


D, 


R, 


N, 


- c, 


E, 


Q/ 


H, 


I, 


L, 


- K, 


M, 


F, 


P, 


s, 


T , 


. w, 


Y, 


- V; 


K181A, 


D, 


R, 


N, 


,C, 


E, 


-Q, 


. G, 


•H, 


I, 


r L, 


M, 


F, 


P, 


s, 


r Tj 


, w, 


• Y, 


r V; 


A186D, 


R, 


•N, 


,C, 


r E, 


• Q, 


, G, 


. H, 


- 1/ 


- 1/ 


> Pi 


, K, 


,M, 


F, 


- s, 


, T j 


r W, 


, Y , 


r V; 


S193A, 


D, 


, R, 


. N, 


' c, 


, E, 




, G, 


, H, 


- 1/ 


, L, 


, K, 


, M, 


F, 


' Pi 


, T j 


r W ( 


, Y t 


, V; 


N195A, 


- D, 


> Ri 


,C, 


r E, 


> Qi 


r G, 


> H, 


- I* 


r L , 




,M, 


' F, 


P, 


F S, 


J, 


J ; 


■ Y, 


f V; 


K269A, 


- D, 


> R> 


r N j 


r C, 


r E ; 


rQ> 


, G, 




r I , 


, L, 




* F, 


P, 


rS, 


, T 


r w ( 


, Y 


r V; 


N270A, 


> Di 






r Ej 




r G i 






, L, 




f M j 


> F, 


- P. 




r T 




r Y 


r V, 


K311A, 


> D, 




r N 


r C, 


, E 


rQ, 


r 


r H ; 


, I 






r Fj 


> Pi 




r T 




, Y 
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K4 5 8A, D, R,N, C,E,Q,G,H, I f L,M, F, P, S,T,W, Y, V; 
P459A, D,R,N,C,E,Q,G,H,I,L,K,M,F,S,T,W,Y,V; 
T4 61A, D f R,N, C, E, Q, G, H, I, L, K, M, F, P, S, W, Y, V. 

5 8. The variant according to claim 7, wherein the variant has one 
or more of the following substitutions: K142R, R181S, A186T, 
S193P, N195F, K269R, N270Y, K311R, K458R, P459T and T461P. 

9. The variant according to claims 1-5 , exhibiting improved Ca 2+ 
10 stability at pH 8 to 10.5, having mutations in one or more of 
the following positions (using the SEQ ID NO: 2 numbering) : 
R181, G182, D183, G184, K185, A186, W189, N195, N270, E346, 
K385, K458, P459. 

15 10. The variant according to claim 9, which variant has one or 
more of the following substitutions or deletions: 



R181*, 


A, 


D r 


N, 


c, 


E, 


Qf 


G,H, 


I, 


L, 


K, 


M, 


F, 


P, 


s. 


T, 


W, Yf 


V 


G182*, 


A, 


Df 


Rf 


N, 


c, 


E, 


Qf H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


W f Y, 


V 


D183*, 


A, 


R, 


N, 


c, 


E, 


Qf 


G, H, 


I, 


L, 


K, 


M f 


F, 


P, 


s, 


T , 


W, Y, 


V 


G184*, 


A, 


Rf 


D, 


N, 


c, 


E/ 


Qf H, 


I, 


L, 


K, 


Mf 


F, 


P, 


s, 


T, 


W, Y, 


V 


K185A, 


D, 


R, 


N, 


c, 


E, 


Qf 


G, H, 


I, 


L, 


M, 


Ff 


P, 


s, 


T, 


W, 


Y,V; 




A186D, 


R, 


N, 


c, 


E, 


Qf 


G, 


H, I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


. w # 


Y,V; 




W189A, 


D, 


R, 


N, 


.c, 


E, 


-Qf 


G,H, 


I, 


-L, 


K, 


Mf 


F, 


p, 


s, 


, T, 


- Y,V; 




N195A, 


D, 


R/ 


c, 


- E, 


Q, 


-G, 


H,I, 


L, 


, K, 


M, 


F f 


P, 


s, 


T, 


, W, 


. Y,V; 




N270A, 


• R, 


D, 


N, 


' c, 


E, 




H f I, 


L, 


■ K, 


M, 


Ff 


P, 


s, 


- T # 


, w, 


- Y,V; 




E346A, 


, R, 


D, 


- N, 


,C, 


- Q, 


- G # 


,H, I, 


L, 


, K, 


, M # 


> F f 


P, 




, T i 


, w, 


> Y,V; 




K385A, 


, R, 


D, 


,N ; 


> c, 


. E # 




> G, H, 


- 1/ 


r L, 


, M, 


> F f 


R f 


s, 


. T , 


, w, 


'YfV; 




K458A, 


, R, 


, D, 


J, 




r E| 


>Q, 


,G f H, 




,L, 


,M, 


, F. 


P, 


• s, 


, T, 




,Y f V; 




P459A, 


. R, 


' D, 




r C, 




'Q, 


, G, H, 


' Ij 


, L, 


> K, 


,M, 


F, 


s, 




r W, 


, Yf V. 





30 

11. The variant according to claim 10, wherein the variant has 
one or more of the following substitutions or deletions: 
R181Q,N; G182T,S,N; D183*; G184*; 

K185A, R, D f C,E,Q f G f H f I / L,M / N / F, P, S,T,W,Y,V; A18 6T , S , N, I , V; 
35 W189T,S,N,Q; N195F; N270R,D; E346Q; K385R; K458R; P459T. 



12. A variant according to claims 1-11, wherein the parent 
Termamyl-like a-amylase is selected from: 
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the Bacillus strain NCIB 12512 a-amylase having the sequence 
shown in SEQ ID NO: 1; 

the B. amyloliquefaciens a-amylase having the sequence shown in 
SEQ ID NO: 5; 

5 the B. licheniformis a-amylase having the sequence shown in SEQ 
ID NO: 4. 

13. The variant according to claims 1-5, exhibiting increased 
specific activity at a temperatures from 10 to 60°C, preferably 

10 20-50°C, especially 30-40°C, having mutation (s) in one or more of 
the following positions (using the SEQ ID NO: 2 numbering): 
H107, K108, G109, D166, W167, D168, Q169, S170, R171, Q172, 
F173, Q174, D183, G184, N195, F267, W268, K269,N270, D271, L272, 
G273, A274, L275, G456, N457, K458, P459, G460, T461, V462, 

15 T463. 



20 



25 



30 



35 



14. The 
more of 



variant according to claim 13, which variant has one or 
the following substitutions: 



H107A, 


D, 


R, 


N, 


c, 


E, 


Q, 


G, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


W, 


Y, 




K108A, 


D, 


R, 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


M, 


F, 


P, 


s, 


T, 


W, 


Y, 


v ( 


G109A, 


D, 


R, 


N, 


c, 


E, 


Qr 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T , 


w, 


Y, 


V 


D166A, 


R, 


N , 


c, 


E, 


Qf 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, 


V 


W167A, 


D, 


R, 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


p, 


s, 


T , 


Y, 


V 


D168A, 


R, 


N, 


c, 


E, 


Qf 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T , 


W, 


Y, 


V 


Q169A, 


D, 


R, 


N, 


c, 


E, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, 


V 


S170A, 


D, 


R, 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


p, 


T, 


w, 


Y, 


V 


R171A, 


D, 


N, 


c, 


E, 


Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, 


- V 


Q172A, 


D, 


R, 


N, 


c, 


E, 


G, 


H, 


I, 


- L, 


- K, 


M, 


F, 


P, 


s, 


- T, 


, w, 


Y, 


, V 


F173A, 


D, 


R, 


. N, 


c, 


- E, 


Q, 


. G, 


H, 


' 1/ 


, L, 


- K, 


M, 


P, 


s, 


- T , 


. w # 


Y, 


, V 


Q174*, 


A, 


- D, 


- R, 


•N, 


• c, 


- E, 


- G, 


• H, 


' 1/ 


r L, 


r K, 


M, 


F, 


p, 


>s, 


r T, 


, W, 


, Y 


D183*, 


A; 


, D, 


, R, 


, N, 


, C, 


f E, 


rQ, 


. G, 


r H, 


r I ( 


r L, 


- K, 


M, 


' F, 


r Pi 


r S, 


, W, 


, Y 


G184*, 


A, 


> Ri 


r N j 


, C, 


, E, 




- G/ 


, H, 


> I| 


, L, 


r K, 


,M, 


F, 


- Pi 


r S ( 


r T , 




, Y 


N195A, 


, D, 


> Ri 


> c, 


, E, 


>Q, 


f G, 


r H, 


r I. 


r L, 


r K, 


rM, 


r F, 




, s, 


r T ( 




r Y ( 


r V 


F267A, 


> D, 


f R. 


r N t 


r C, 






r G, 


r H, 


r I, 


r L, 


r K ( 


rM, 


- P, 




r T 




r Y 


r V 


W268A, 




r R, 


f N, 


r C, 




rQ, 


rG t 


r H 


, I 


, L 




, M, 


' F, 


, P 


r S 


, T 


, Y 


r V 


K269A, 


r D, 


r R 


r N 


f c 


r E 


rQ, 


f G, 


r H 


r I 


r L 




r F, 


> P ( 


, s 


, T 


r W 


r Y 


r v 


N270A, 


, D 


t R 


f c 


r E 


rQ 


r G 


f H ( 


, I 


, L 


f K 


f M ( 


r F, 


, P 


r S 


, T 


, w 


, Y 


r V 



V 
V 
V 
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D271A, 


R, 


N, 


c, 


E,Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, 


V; 


L272A, 


D, 


R f 


N, 


C, E, 


Q, 


G, 


H, 


I, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, 


V; 


G273A, 


D, 


R, 


N, 


C, E, 


Q, 


H, 


I, 


L, 


K, 


M, 




P, 


s, 


T, 


w, 


Y, 


V; 


A274D, 


R, 


N, 


c, 


E,Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, 


V; 


L275A, 


D, 


R, 


N, 


C,E, 


Q, 


G, 


H, 


I, 


K, 


M, 


F, 


P, 


s, 


T, 


w, 


Y, 


V; 


G456A, 


D, 


R/ 


N , 


C,E, 


Q, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s # 


T, 


. w, 


Y, 


. V; 


N457A, 


D, 


R/ 


c, 


• E, Q, 


G, 


H, 


I, 


L, 


K, 


M, 


F, 


P, 


s, 


. T , 


. w, 


Y, 


> V; 


K458A, 


D, 


R; 


N, 


r C ^ E ; 


• Q, 


- G, 


H, 


- I, 


r L, 


,M, 


F, 


P, 


s, 


' Tj 


, W, 


Y, 


» V; 


P459A, 


D, 


R, 


N t 


> C, E, 




- G, 


H, 


- I, 


r L, 


r K, 


M, 


F, 


s, 


r T\ 


J, 


r Y t 


r V; 


G460A, 


• D, 


- R, 


,N, 


rC^E, 




-H, 


I, 


r L j 




, M, 


F, 


- P, 


s, 


r T, 


r W, 


, Y i 


r V; 


T461A, 


, D, 


, R, 




r C , E , 




r G, 


H, 


r I J 


,L, 




, M, 


- F, 


• p, 


, S 




r Y t 


r V; 


V462A, 


> D, 


r Ri 


,N 


r C, E, 


rQ, 




H, 


r I , 




r K, 


rM, 


' F, 


> Pi 


, S 






r Y; 


T463A, 


, D 


r Ri 


r N 


f C, E 


rQ 


rG, 


H, 


, I 


f L 




, M, 


> F, 


r P 


, S 


f w 1 


r Y 


r V. 



15 15. The variant according to claim 14, wherein the variant has 
one or more of the following substitutions or deletions: 
Q174*, D183*, G184*, N195F, K269S. 

16. The variant according to claims 13-15, wherein the parent 

20 Termamyl-like a-amylase is the B. licheniformis a-amylase having 
the sequence shown in SEQ ID NO: 4. 

17. A DNA construct comprising a DNA sequence encoding an a- 
amylase variant according to any one of claims 1-16. 

25 

18. A recombinant expression vector which carries a DNA con- 
struct according to claim 17. 

19. A cell which is transformed with a DNA construct according 
30 to claim 17 or a vector according to claim 18. 

20. A cell according to claim 19, which is a microorganism. 

21. A cell according to claim 20, which is a bacterium or a 
35 fungus . 



22. The cell according to claim 21, which is a Gram positive 
bacterium such as Bacillus subtilis , Bacillus licheniformis , 
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Bacillus lentus, Bacillus brevis, Bacillus stearothermophilus , 
Bacillus alkalophilus , Bacillus amyloliquefaciens, Bacillus 
coagulans , Bacillus circulans , Bacillus lautus or Bacillus thu- 
ringiensis . 

23. Use of an a-amylase variant according to any one of claims 
1-16 for washing and/or dishwashing. 

24. A detergent additive comprising an a-amylase variant accor- 
ding to any one of claims 1-16, optionally in the form of a non- 
dusting granulate, stabilized liquid or protected enzyme. 

25. A detergent additive according to claim 24 which contains 
0.02-200 mg of enzyme protein/g of the additive. 

26. A detergent additive according to claims 24 or 25, which 
additionally comprises another enzyme such as a protease, a 
lipase, a peroxidase, another amylolytic enzyme and/or a 
cellulase . 

27. A detergent composition comprising an a-amylase variant ac- 
cording to any of claims 1-16. 

28. A detergent composition according to claim 27 which addi- 
tionally comprises another enzyme such as a protease, a lipase, 
a peroxidase, another amylolytic enzyme and/or a cellulase. 

29. A manual or automatic dishwashing detergent composition 

comprising an a-amylase variant according to any one of claims 
1-16. 

30. A dishwashing detergent composition according to claim 29 
which additionally comprises another enzyme such as a protease, 
a lipase, a peroxidase, another amylolytic enzyme and/or a 
cellulase . 



31. A manual or automatic laundry washing composition comprising 
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an a-amylase variant according to any one of claims 1-16. 

32. A laundry washing composition according to claim 31, which 
additionally comprises another enzyme such as a protease, a 
lipase, a peroxidase, an amylolytic enzyme and/or a cellulase. 

33. Method for providing a-amylases with 

1) altered pH optimum, and/or 

2) altered temperature optimum, and/or 

3) improved stability, 
comprising the following steps: 

i) identifying (a) target position (s) and/or region (s) for 
mutation of the a-amylase by comparing the molecular dynamics of 
two or more a-amylase' s 3D structures having substantially 
different pH, temperature and/or stability profiles, 

ii) substituting, adding and/or deleting one or more amino acids 
in the identified position(s) and/or region(s). 

34. The method according to claim 33, wherein a medium 
temperature a-amylase is compared with a high temperature a- 
amylase. 

35. The method according to claim 33, wherein a low temperature 
a-amylase is compared with a medium or high temperature a- 
amylase . 

36. The method according to claims 33-35, wherein the a-amylases 
are at least 70%, preferably 80%, up to 90%, such as up to 95%, 
especially 95% homologous. 

37. The method according to claim 36, wherein the a-amylases 
compared are Termamyl-like a-amylases. 

38. The method according to claim 28, wherein the a-amylases 
compared are any of the a-amylases shown in SEQ ID NO: 1 to SEQ 
ID NO: 8. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

5 (A) NAME : NOVO NORDISK A/S 

(B) STREET: Novo Alle 

(C) CITY: DK-2880 Bagsvaerd 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2880 
10 (G) TELEPHONE: +45 44 44 88 88 

(H) TELEFAX: +45 44 49 32 56 
(ii) TITLE OF INVENTION: a-amylase mutants 
(iii) NUMBER OF SEQUENCES: 4 6 
(iv) COMPUTER READABLE FORM: 
15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

20 (2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
30 1 5 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 

20 25 30 

35 Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 

35 40 45 



40 



55 



Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 



Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
45 85 90 95 

lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 

100 105 110 

50 Gly Thr Glu lie Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 

115 120 125 



Gin Glu Thr Ser Gly Glu Tyr Ala lie Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 

145 150 155 160 
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His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 

165 170 175 

lie Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 

180 . -185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 

lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 

245 250 255 

Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 

260 265 270 

Gly Ala lie Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Tyr Tyr Asp Met Arg Asn lie Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 

325 330 335 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 

340 345 350 

Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 

Lys lie Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asp lie He Gly Trp Thr Arg Glu 

405 410 415 

Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr He Met Ser Asp 

420 425 430 

Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
435 440 445 



Gin Val Trp Arg Asp He Thr Gly Asn Arg Thr Gly Thr Val Thr He 
450 455 460 
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Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 



Val Trp Val Lys Gin 

485 



INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp His 
15 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 

20 25 30 



Asn Leu Arg Asn 
35 

Lys Gly Thr Ser 
50 

Asp Leu Gly Glu 
65 

Thr Arg Ser Gin 



Val Gin Val Tyr 

100 

Ala Thr Glu Asn 
115 

Gin Glu lie Ser 
130 

Phe Pro Gly Arg 
145 

His Phe Asp Gly 



lie Tyr Lys Phe 

180 

Ser Glu Asn Gly 
195 

Asp His Pro Glu 
210 



Arg Gly lie Thr 

40 

Gin Asn Asp Val 
55 

Phe Asn Gin Lys 
70 

Leu Glu Ser Ala 
85 

Gly Asp Val Val 



Val Leu Ala Val 

120 

Gly Asp Tyr Thr 
135 

Gly Asn Thr Tyr 
150 

Val Asp Trp Asp 
165 

Arg Gly Asp Gly 



Asn Tyr Asp Tyr 

200 

Val Val Asn Glu 
215 



Ala lie Trp lie 



Gly Tyr Gly Ala 

60 

Gly Thr Val Arg 
75 

lie His Ala Leu 

90 

Met Asn His Lys 
105 

Glu Val Asn Pro 



lie Glu Ala Trp 

140 

Ser Asp Phe Lys 
155 

Gin Ser Arg Gin 
170 

Lys Ala Trp Asp 
185 

Leu Met Tyr Ala 



Leu Arg Arg Trp 

220 



Pro Pro Ala Trp 
45 

Tyr Asp Leu Tyr 



Thr Lys Tyr Gly 

80 

Lys Asn Asn Gly 
95 

Gly Gly Ala Asp 
110 

Asn Asn Arg Asn 
125 

Thr Lys Phe Asp 



Trp Arg Trp Tyr 

160 

Phe Gin Asn Arg 
175 

Trp Glu Val Asp 
190 

Asp Val Asp Met 
205 

Gly Glu Trp Tyr 



Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
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225 

lie Lys Tyr Ser 



Thr Gly Lys Glu 

260 

Gly Ala Leu Glu 
275 

Phe Asp Val Pro 
290 

Gly Asn Tyr Asp 
305 

His Pro Met His 



Gly Glu Ser Leu 

340 

Tyr Ala Leu lie 
355 

Gly Asp Tyr Tyr 
370 

Lys lie Asp Pro 
385 

Gin His Asp Tyr 



Gly Asn Thr Thr 

420 

Gly Pro Gly Gly 
435 

Gin Val Trp His 
450 

Asn Ala Asp Gly 
465 



230 

Phe Thr Arg Asp 
245 

Met Phe Ala Val 



Asn Tyr Leu Asn 

280 

Leu His Tyr Asn 
295 

Met Ala Lys Leu 
310 

Ala Val Thr Phe 
325 

Glu Ser Phe Val 



Leu Thr Arg Glu 

360 

Gly lie Pro Thr 
375 

lie Leu Glu Ala 
390 

Phe Asp His His 
405 

His Pro Asn Ser 



Glu Lys Trp Met 

440 

Asp lie Thr Gly 
455 

Trp Ala Asn Phe 
470 



4 

235 

Trp Leu Thr His 
250 

Ala Glu Phe Trp 
265 

Lys Thr Asn Trp 



Leu Tyr Asn Ala 

300 

Leu Asn Gly Thr 
315 

Val Asp Asn His 
330 

Gin Glu Trp Phe 
345 

Gin Gly Tyr Pro 



His Ser Val Pro 

380 

Arg Gin Asn Phe 

395 

Asn lie lie Gly 
410 

Gly Leu Ala Thr 
425 

Tyr Val Gly Gin 



Asn Lys Pro Gly 

460 

Ser Val Asn Gly 
475 



240 

Val Arg Asn Ala 
255 

Lys Asn Asp Leu 
270 

Asn His Ser Val 
285 

Ser Asn Ser Gly 



Val Val Gin Lys 

320 

Asp Ser Gin Pro 
335 

Lys Pro Leu Ala 
350 

Ser Val Phe Tyr 
365 

Ala Met Lys Ala 



Ala Tyr Gly Thr 

400 

Trp Thr Arg Glu 
415 

lie Met Ser Asp 
430 

Asn Lys Ala Gly 
445 

Thr Val Thr lie 



Gly Ser Val Ser 

480 



lie Trp Val Lys Arg 

485 



(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



WO 99/23211 



PCT/DK98/00471 



Ala Ala Pro Phe 
1 

Pro Asp Asp Gly 

20 

Leu Ser Ser Leu 
35 

Gly Thr Ser Arg 
50 

Leu Gly Glu Phe 
65 

Lys Ala Gin Tyr 



Gin Val Tyr Ala 

100 

Thr Glu Trp Val 
115 

Glu lie Ser Gly 
130 

Pro Gly Arg Gly 
145 

Phe Asp Gly Val 



Lys Phe Arg Gly 

180 

Asn Gly Asn Tyr 
195 

Pro Glu Val Val 
210 

Thr Thr Asn lie 
225 

Phe Ser Phe Phe 



Lys Pro Leu Phe 

260 

Leu His Asn Tyr 
275 



Asn Gly Thr Met 
5 

Thr Leu Trp Thr 



Gly lie Thr Ala 

40 

Ser Asp Val Gly 
55 

Asn Gin Lys Gly 
70 

Leu Gin Ala lie 
85 

Asp Val Val Phe 



Asp Ala Val Glu 

120 

Thr Tyr Gin lie 
135 

Asn Thr Tyr Ser 
150 

Asp Trp Asp Glu 
165 

lie Gly Lys Ala 



Asp Tyr Leu Met 

200 

Thr Glu Leu Lys 
215 

Asp Gly Phe Arg 
230 

Pro Asp Trp Leu 
245 

Thr Val Gly Glu 



lie Met Lys Thr 

280 



5 

Met Gin Tyr Phe 
10 

Lys Val Ala Asn 
25 

Leu Trp Leu Pro 



Tyr Gly Val Tyr 

60 

Ala Val Arg Thr 
75 

Gin Ala Ala His 
90 

Asp His Lys Gly 
105 

Val Asn Pro Ser 



Gin Ala Trp Thr 

140 

Ser Phe Lys Trp 
155 

Ser Arg Lys Leu 
170 

Trp Asp Trp Glu 
185 

Tyr Ala Asp Leu 



Ser Trp Gly Lys 

220 

Leu Asp Ala Val 
235 

Ser Asp Val Arg 
250 

Tyr Trp Ser Tyr 
265 

Asn Gly Thr Met 



Glu Trp Tyr Leu 
15 

Glu Ala Asn Asn 
30 

Pro Ala Tyr Lys 
45 

Asp Leu Tyr Asp 



Lys Tyr Gly Thr 

80 

Ala Ala Gly Met 
95 

Gly Ala Asp Gly 
110 

Asp Arg Asn Gin 
125 

Lys Phe Asp Phe 



Arg Trp Tyr His 

160 

Ser Arg lie Tyr 
175 

Val Asp Thr Glu 
190 

Asp Met Asp His 
205 

Trp Tyr Val Asn 



Lys His lie Lys 

240 

Ser Gin Thr Gly 
255 

Asp lie Asn Lys 
270 

Ser Leu Phe Asp 
285 



Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Thr 
290 295 300 
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Phe Asp Met Arg 
305 

Thr Leu Ala Val 



Ala Leu Gin Ser 

340 

Phe lie Leu Thr 
355 

Tyr Tyr Gly lie 
370 

Asp Pro Leu Leu 
385 

Asp Tyr Leu Asp 



Thr Glu Lys Pro 

420 

Gly Gly Ser Lys 
435 

Phe Tyr Asp Leu 
450 

Asp Gly Trp Gly 
465 

Val Pro Arg Lys 



Arg Pro Trp Thr 

500 

Ala Trp 



Thr Leu Met Thr 
310 

Thr Phe Val Asp 
325 

Trp Val Asp Pro 



Arg Gin Glu Gly 

360 

Pro Gin Tyr Asn 
375 

lie Ala Arg Arg 
390 

His Ser Asp lie 
405 

Gly Ser Gly Leu 



Trp Met Tyr Val 

440 

Thr Gly Asn Arg 
455 

Glu Phe Lys Val 
470 

Thr Thr Val Ser 
485 

Asp Glu Phe Val 



6 

Asn Thr Leu Met 
315 

Asn His Asp Thr 
330 

Trp Phe Lys Pro 
345 

Tyr Pro Cys Val 



lie Pro Ser Leu 

380 

Asp Tyr Ala Tyr 
395 

lie Gly Trp Thr 
410 

Ala Ala Leu lie 
425 

Gly Lys Gin His 



Ser Asp Thr Val 

460 

Asn Gly Gly Ser 
475 

Thr lie Ala Trp 
490 

Arg Trp Thr Glu 
505 



Lys Asp Gin Pro 

320 

Glu Pro Gly Gin 
335 

Leu Ala Tyr Ala 
350 

Phe Tyr Gly Asp 
365 

Lys Ser Lys lie 



Gly Thr Gin His 

400 

Arg Glu Gly Val 
415 

Thr Asp Gly Pro 
430 

Ala Gly Lys Val 
445 

Thr lie Asn Ser 



Val Ser Val Trp 

480 

Ser lie Thr Thr 
495 

Pro Arg Leu Val 
510 



(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro 
15 10 15 

Asn Asp Gly Gin His Trp Arg Arg Leu Gin Asn Asp Ser Ala Tyr Leu 

20 25 30 

Ala Glu His Gly He Thr Ala Val Trp He Pro Pro Ala Tyr Lys Gly 
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Thr Ser Gin Ala 
50 

Gly Glu Phe His 
65 

Gly Glu Leu Gin 



Val Tyr Gly Asp 

100 

Glu Asp Val Thr 
115 

lie Ser Gly Glu 
130 

Gly Arg Gly Ser 
145 

Asp Gly Thr Asp 



Phe Gin Gly Lys 

180 

Tyr Asp Tyr Leu 
195 

Ala Ala Glu He 
210 

Leu Asp Gly Phe 
225 

Leu Arg Asp Trp 



Phe Thr Val Ala 

260 

Tyr Leu Asn Lys 
275 

His Tyr Gin Phe 
290 

Arg Lys Leu Leu 
305 

Val Thr Phe Val 



Ser Thr Val Gin 

340 



Asp Val Gly Tyr 
55 

Gin Lys Gly Thr 
70 

Ser Ala He Lys 
85 

Val Val He Asn 



Ala Val Glu Val 

120 

His Leu He Lys 

135 . 

Thr Tyr Ser Asp 
150 

Trp Asp Glu Ser 
165 

Ala Trp Asp Trp 



Met Tyr Ala Asp 

200 

Lys Arg Trp Gly 
215 

Arg Leu Asp Ala 
230 

Val Asn His Val 
245 

Glu Tyr Trp Gin 



Thr Asn Phe Asn 

280 

His Ala Ala Ser 
295 

Asn Gly Thr Val 
310 

Asp Asn His Asp 
325 

Thr Trp Phe Lys 



7 



Gly Ala Tyr Asp 

60 

Val Arg Thr Lys 
75 

Ser Leu His Ser 
90 

His Lys Gly Gly 
105 

Asp Pro Ala Asp 



Ala Trp Thr His 

140 

Phe Lys Trp His 
155 

Arg Lys Leu Asn 
170 

Glu Val Ser Asn 
185 

He Asp Tyr Asp 



Thr Trp Tyr Ala 

220 

Val Lys His lie 
235 

Arg Glu Lys Thr 
250 

Asn Asp Leu Gly 
265 

His Ser Val Phe 



Thr Gin Gly Gly 

300 

Val Ser Lys His 
315 

Thr Gin Pro Gly 
330 

Pro Leu Ala Tyr 
345 



45 

Leu Tyr Asp Leu 



Tyr Gly Thr Lys 

80 

Arg Asp He Asn 
95 

Ala Asp Ala Thr 
110 

Arg Asn Arg Val 
125 

Phe His Phe Pro 



Trp Tyr His Phe 

160 

Arg He Tyr Lys 
175 

Glu Asn Gly Asn 
190 

His Pro Asp Val 
205 

Asn Glu Leu Gin 



Lys Phe Ser Phe 

240 

Gly Lys Glu Met 
255 

Ala Leu Glu Asn 
270 

Asp Val Pro Leu 
285 

Gly Tyr Asp Met 



Pro Leu Lys Ser 

320 

Gin Ser Leu Glu 
335 

Ala Phe He Leu 
350 



WO 99/23211 



PCT/DK98/00471 



Thr Arg Glu Ser 
355 

Thr Lys Gly Asp 
370 

Glu Pro lie Leu 
385 

Asp Tyr Phe Asp 



Ser Ser Val Ala 

420 

Gly Gly Ala Lys 
435 

Trp His Asp lie 
450 

Glu Gly Trp Gly 
465 



Gly Tyr Pro Gin 

360 

Ser Gin Arg Glu 
375 

Lys Ala Arg Lys 
390 

His His Asp lie 
405 

Asn Ser Gly Leu 



Arg Met Tyr Val 

440 

Thr Gly Asn Arg 
455 

Glu Phe His Val 
470 



8 

Val Phe Tyr Gly 



lie Pro Ala Leu 

380 

Gin Tyr Ala Tyr 
395 

Val Gly Trp Thr 
410 

Ala Ala Leu lie 
425 

Gly Arg Gin Asn 



Ser Glu Pro Val 

460 

Asn Gly Gly Ser 
475 



Asp Met Tyr Gly 
365 

Lys His Lys lie 



Gly Ala Gin His 

400 

Arg Glu Gly Asp 
415 

Thr Asp Gly Pro 
430 

Ala Gly Glu Thr 
445 

Val lie Asn Ser 



Val Ser lie Tyr 

480 



Val Gin Arg 



2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Val Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Thr Pro Asn Asp 
15 10 15 

Gly Gin His Trp Lys Arg Leu Gin Asn Asp Ala Glu His Leu Ser Asp 

20 25 30 

lie Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly Leu Ser 

35 40 45 

Gin Ser Asp Asn Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu Gly Glu 
50 55 60 

Phe Gin Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ser Glu 
65 70 75 80 

Leu Gin Asp Ala lie Gly Ser Leu His Ser Arg Asn Val Gin Val Tyr 

85 90 95 



Gly Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala Thr Glu Asp 

100 105 110 
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Val Thr Ala Val Glu Val Asn Pro Ala Asn Arg Asn Gin Glu Thr Ser 
115 120 125 

Glu Glu Tyr Gin lie Lys Ala Trp Thr Asp Phe Arg Phe Pro Gly Arg 
130 135 140 

Gly Asn Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly 
145 150 155 160 

Ala Asp Trp Asp Glu Ser Arg Lys lie Ser Arg lie Phe Lys Phe Arg 

165 170 175 

Gly Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn 

180 185 190 

Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro Asp Val 
195 200 205 

Val Ala Glu Thr Lys Lys Trp Gly lie Trp Tyr Ala Asn Glu Leu Ser 
210 215 220 

Leu Asp Gly Phe Arg lie Asp Ala Ala Lys His lie Lys Phe Ser Phe 
225 230 235 240 

Leu Arg Asp Trp Val Gin Ala Val Arg Gin Ala Thr Gly Lys Glu Met 

245 250 255 

Phe Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu Glu Asn 

260 265 270 

Tyr Leu Asn Lys Thr Ser Phe Asn Gin Ser Val Phe Asp Val Pro Leu 
275 280 285 

His Phe Asn Leu Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr Asp Met 
290 295 300 

Arg Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu Lys Ala 
305 310 315 320 

Val Thr Phe Val Glu Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 

325 330 335 

Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu 

340 345 350 

Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly 
355 360 365 

Thr Lys Gly Thr Ser Pro Lys Glu lie Pro Ser Leu Lys Asp Asn lie 
370 375 380 

Glu Pro lie Leu Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro Gin His 
385 390 395 400 

Asp Tyr lie Asp His Pro Asp Val lie Gly Trp Thr Arg Glu Gly Asp 

405 410 415 



Ser Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro 
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420 425 

Gly Gly Ser Lys Arg Met Tyr Ala Gly 
435 440 

Trp Tyr Asp lie Thr Gly Asn Arg Ser 
450 455 

Asp Gly Trp Gly Glu Phe His Val Asn 
465 470 



PCT/DK98/00471 

430 

Leu Lys Asn Ala Gly Glu Thr 

445 

Asp Thr Val Lys lie Gly Ser 
460 

Asp Gly Ser Val Ser lie Tyr 
475 480 



INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
1 ~ 5 10 15 



Leu Pro Asn Asp 

20 

Asn Leu Lys Ser 

35 

Lys Gly Ala Ser 
50 

Asp Leu Gly Glu 
65 

Thr Arg Ser Gin 



lie Gin Val Tyr 

100 

Ala Thr Glu Met 
115 

Gin Glu Val Thr 
130 

Phe Pro Gly Arg 
145 

His Phe Asp Gly 



Gly Asn His Trp 



Lys Gly lie Thr 

40 

Gin Asn Asp Val 
55 

Phe Asn Gin Lys 

70 

Leu Gin Ala Ala 
85 

Gly Asp Val Val 



Val Arg Ala Val 

120 

Gly Glu Tyr Thr 
135 

Gly Asn Thr His 
150 

Val Asp Trp Asp 
165 



Asn Arg Leu Asn 
25 

Ala Val Trp He 



Gly Tyr Gly Ala 

60 

Gly Thr Val Arg 
75 

Val Thr Ser Leu 

90 

Met Asn His Lys 
105 

Glu Val Asn Pro 



He Glu Ala Trp 

140 

Ser Ser Phe Lys 
155 

Gin Ser Arg Arg 
170 



Ser Asp Ala Ser 
30 

Pro Pro Ala Trp 

45 

Tyr Asp Leu Tyr 



Thr Lys Tyr Gly 

80 

Lys Asn Asn Gly 
95 

Gly Gly Ala Asp 
110 

Asn Asn Arg Asn 
125 

Thr Arg Phe Asp 



Trp Arg Trp Tyr 

160 

Leu Asn Asn Arg 
175 



He Tyr Lys Phe Arg Gly His Gly Lys Ala Trp Asp Trp Glu Val Asp 

180 185 190 
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Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp lie Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Gly Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 

lie Lys Tyr Ser Phe Thr Arg Asp Trp lie Asn His Val Arg Ser Ala 

245 250 255 

Thr Gly Lys Asn Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 

260 265 270 

Gly Ala lie Glu Asn Tyr Leu Gin Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Lys Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Arg Asn lie Phe Asn Gly Thr Val Val Gin Arg 
305 310 315 320 

His Pro Ser His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 

325 330 335 

Glu Glu Ala Leu Glu Ser Phe Val Glu Glu Trp Phe Lys Pro Leu Ala 

340 345 350 

Tyr Ala Leu Thr Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Arg Ser 
370 375 380 

i_,ys lie Asp Pro lie Leu Glu Ala Arg Gin Lys Tyr Ala Tyr Gly Lys 
385 390 395 400 

Gin Asn Asp Tyr Leu Asp His His Asn lie lie Gly Trp Thr Arg Glu 

405 410 415 

Gly Asn Thr Ala His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 

420 425 430 

Gly Ala Gly Gly Ser Lys Trp Met Phe Val Gly Arg Asn Lys Ala Gly 
435 440 445 

Gin Val Trp Ser Asp lie Thr Gly Asn Arg Thr Gly Thr Val Thr lie 
450 455 460 

Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 



lie Trp Val Asn Lys 

4 85 
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INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
15 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 

20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie. Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 

85 90 95 

lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 

100 105 110 

Gly Thr Glu lie Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
115 120 125 

Gin Glu Thr Ser Gly Glu Tyr Ala lie Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 

165 170 175 

lie Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 

180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 



lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 

245 250 255 
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Thr Gly Lys Pro 

260 

Gly Ala lie Glu 
275 

Phe Asp Val Pro 
290 

Gly Tyr Tyr Asp 
305 

His Pro Thr His 



Gly Glu Ala Leu 

340 

Tyr Ala Leu Val 
355 

Gly Asp Tyr Tyr 
370 

Lys lie Asp Pro 
385 

Gin His Asp Tyr 



Gly Asn Ser Ser 

420 

Gly Pro Gly Gly 
435 

Gin Val Trp Arg 
450 

Asn Ala Asp Gly 
465 



Met Phe Ala Val 



Asn Tyr Leu Asn 

280 

Leu His Tyr Asn 

295 

Met Arg Asn lie 
310 

Ala Val Thr Phe 
325 

Glu Ser Phe Val 



Leu Thr Arg Glu 

360 

Gly lie Pro Thr 
375 

Leu Leu Gin Ala 
390 

Phe Asp His His 
405 

His Pro Asn Ser 



Asn Lys Trp Met 

440 

Asp lie Thr Gly 
455 

Trp Gly Asn Phe 
470 
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Ala Glu Phe Trp 
265 

Lys Thr Ser Trp 



Leu Tyr Asn Ala 

300 

Leu Asn Gly Ser 
315 

Val Asp Asn His 
330 

Gin Gin Trp Phe 
345 

Gin Gly Tyr Pro 



His Gly Val Pro 

380 

Arg Gin Thr Phe 
395 

Asp lie lie Gly 
410 

Gly Leu Ala Thr 
425 

Tyr Val Gly Lys 



Asn Arg Thr Gly 

460 

Ser Val Asn Gly 
475 



Lys Asn Asp Leu 
270 

Asn His Ser Val 
285 

Ser Asn Ser Gly 



Val Val Gin Lys 

320 

Asp Ser Gin Pro 
335 

Lys Pro Leu Ala 
350 

Ser Val Phe Tyr 
365 

Ala Met Lys Ser 



Ala Tyr Gly Thr 

400 

Trp Thr Arg Glu 
415 

lie Met Ser Asp 
430 

Asn Lys Ala Gly 
445 

Thr Val Thr lie 



Gly Ser Val Ser 

480 



Val Trp Val Lys Gin 

485 



(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp His 
15 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ser 
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20 25 30 

Asn Leu Arg Asn Arg Gly lie Thr Ala He Trp He Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Ser Gin Leu Glu Ser Ala He His Ala Leu Lys Asn Asn Gly 

85 90 95 

Val Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 

100 105 HO 

Ala Thr Glu Asn Val Leu Ala Val Glu Val Asn Pro Asn Asn Arg Asn 
115 120 125 

Gin Glu He Ser Gly Asp Tyr Thr He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Val Asp Trp Asp Gin Ser Arg Gin Phe Gin Asn Arg 

165 170 175 

He Tyr Lys Phe Arg Gly Asp Gly Lys Ala Trp Asp Trp Glu Val Asp 

180 185 190 

Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val Val Asn Glu Leu Arg Arg Trp Gly Glu Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
225 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Ala 

245 250 255 

Thr Gly Lys Glu Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 

260 265 270 

Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Asn Tyr Asp Met Ala Lys Leu Leu Asn Gly Thr Val Val Gin Lys 
305 310 315 320 



His Pro Met His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 

325 330 335 



WO 99/23211 



PCT/DK98/00471 



15 

Gly Glu Ser Leu Glu Ser Phe Val Gin Glu Trp Phe Lys Pro Leu Ala 

340 345 350 

Tyr Ala Leu lie Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly lie Pro Thr His Ser Val Pro Ala Met Lys Ala 
370 375 380 

Lys lie Asp Pro lie Leu Glu Ala Arg Gin Asn Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asn lie lie Gly Trp Thr Arg Glu 

405 410 415 

Gly Asn Thr Thr His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 

420 425 430 

Gly Pro Gly Gly Glu Lys Trp Met Tyr Val Gly Gin Asn Lys Ala Gly 
435 440 445 

Gin Val Trp His Asp lie Thr Gly Asn Lys Pro Gly Thr Val Thr lie 
450 455 460 

Asn Ala Asp Gly Trp Ala Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

lie Trp Val Lys Arg 

485 

(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CATCATAATG GAACAAATGG TACTATGATG CAATATTTCG AATGGTATTT GCCAAATGAC 60 

GGGAATCATT GGAACAGGTT GAG G GAT G AC GCAGCTAACT TAAAGAGTAA AGGGATAACA 120 

GCTGTATGGA TCCCACCTGC ATGGAAGGGG ACTTCCCAGA ATGATGTAGG TTATGGAGCC 180 

TATGATTTAT ATGATCTTGG AGAGTTTAAC CAGAAGGGGA CGGTTCGTAC AAAATATGGA 24 0 

ACACGCAACC AGCTACAGGC TGCGGTGACC TCTTTAAAAA ATAACGGCAT TCAGGTATAT 300 

GGTGATGTCG TCATGAATCA TAAAGGTGGA GCAGATGGTA CGGAAATTGT AAATGCGGTA 360 

GAAGTGAATC GGAGCAACCG AAACCAGGAA ACCTCAGGAG AGTATGCAAT AGAAGCGTGG 4 20 

ACAAAGTTTG ATTTTCCTGG AAG AG G AAAT AACCATTCCA GCTTTAAGTG GCGCTGGTAT 4 80 

CATTTTGATG GGACAGATTG GGATCAGTCA CGCCAGCTTC AAAACAAAAT AT AT AAAT T C 54 0 
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AGGGGAACAG GCAAGGCCTG GGACTGGGAA GTCGATACAG AGAATGGCAA C TAT G AC TAT 600 

CTTATGTATG CAGACGTGGA TATGGATCAC CCAGAAGTAA TACATGAACT TAGAAACTGG 660 

5 GGAGTGTGGT ATACGAATAC ACTGAACCTT GATGGATTTA GAATAGATGC AG T G AAAC AT 720 

ATAAAATATA GCTTTACGAG AGATTGGCTT ACACATGTGC GTAACACCAC AGGTAAACCA 7 80 
ATGTTTGCAG TGGCTGAGTT TTGGAAAAAT GACCTTGGTG CAATTGAAAA CTATTTGAAT 8 40 

10 

AAAACAAGTT GGAATCACTC GGTGTTTGAT GTTCCTCTCC ACTATAATTT GTACAATGCA 900 
TCTAATAGCG GTGGTTATTA T GAT AT GAGA AATATTTTAA ATGGTTCTGT GGTGCAAAAA 960 

15 CATCCAACAC ATGCCGTTAC TTTTGTTGAT AACCATGATT CTCAGCCCGG GGAAGCATTG 1020 

GAATCCTTTG TTCAACAATG GTTTAAACCA CTTGCATATG CATTGGTTCT GACAAGGGAA 108 0 

CAAGGTTATC CTTCCGTATT TTATGGGGAT TACTACGGTA TCCCAACCCA TGGTGTTCCG 114 0 

20 

GCTATGAAAT C T AAAAT AG A CCCTCTTCTG CAGGCACGTC AAACTTTTGC CTATGGTACG 1200 

CAGCATGATT ACTTTGATCA T CAT GAT ATT ATCGGTTGGA CAAGAGAGGG AAATAGCTCC 12 60 

25 CATCCAAATT CAGGCCTTGC CACCATTATG TCAGATGGTC CAGGTGGTAA CAAATGGATG 1320 

TATGTGGGGA AAAAT AAAGC GGGACAAGTT TGGAGAGATA TTACCGGAAA TAGGACAGGC 138 0 

ACCGTCACAA TTAATGCAGA CGGATGGGGT AATTTCTCTG TTAATGGAGG GTCCGTTTCG 14 4 0 

30 

GTTTGGGTGA AGCAA 14 55 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CATCATAATG GGACAAATGG GACGATGATG CAATACTTTG AATGGCACTT GCCTAATGAT 60 

GGGAATCACT GGAATAGATT AAG AG AT GAT GCTAGTAATC TAAGAAATAG AGGTATAACC 120 

45 

GCTATTTGGA TTCCGCCTGC CTGGAAAGGG ACTTCGCAAA ATGATGTGGG GTATGGAGCC 18 0 

TATGATCTTT AT GAT T TAG G GGAATTTAAT CAAAAGGGGA CGGTTCGTAC TAAGTATGGG 24 0 

50 ACACGTAGTC AATTGGAGTC TGCCATCCAT GCTTTAAAGA ATAATGGCGT TCAAGTTTAT 300 

GGGGATGTAG TGATGAACCA TAAAGGAGGA GCTGATGCTA CAGAAAACGT TCTTGCTGTC 360 

GAGGTGAATC CAAATAACCG GAATCAAGAA ATATCTGGGG AC T AC AC AAT TGAGGCTTGG 420 

55 

ACTAAGTTTG ATTTTCCAGG GAGGGGTAAT ACATACTCAG ACTTTAAATG GCGTTGGTAT 48 0 

CATTTCGATG GTGTAGATTG GGATCAATCA CGACAATTCC AAAAT CGT AT CTACAAATTC 54 0 



35 



40 
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CGAGGTGATG GTAAGGCATG GGATTGGGAA GTAGATTCGG AAAATGGAAA T TAT GAT TAT 600 

TTAATGTATG CAGATGTAGA TAT G GAT CAT CCGGAGGTAG TAAATGAGCT TAGAAGATGG 660 

GGAGAATGGT ATACAAATAC ATTAAATCTT GATGGATTTA GGATCGATGC GGTGAAGCAT 720 

ATTAAATATA GCTTTACACG TGATTGGTTG ACCCATGTAA GAAACGCAAC GGGAAAAGAA 78 0 

ATGTTTGCTG TTGCTGAATT TTGGAAAAAT GATTTAGGTG CCTTGGAGAA CTATTTAAAT 84 0 

AAAACAAACT GGAATCATTC TGTCTTTGAT GTCCCCCTTC ATTATAATCT TTATAACGCG 900 

TCAAATAGTG GAGGCAACTA TGACATGGCA AAACTTCTTA ATGGAACGGT TGTTCAAAAG 960 

CATCCAATGC ATGCCGTAAC TTTTGTGGAT AATCACGATT CTCAACCTGG G G AAT CAT T A 1020 

GAATCATTTG T AC AAGAAT G GTTTAAGCCA CTTGCTTATG CGCTTATTTT AACAAGAGAA 108 0 

CAAGGCTATC CCTCTGTCTT CTATGGTGAC TACTATGGAA TTCCAACACA TAGTGTCCCA 114 0 

GCAAT GAAAG CCAAGATTGA TCCAATCTTA GAGGCGCGTC AAAATTTTGC ATATGGAACA 1200 

CAACATGATT ATTTTGACCA T CAT AAT AT A ATCGGATGGA CACGTGAAGG AAATACCACG 12 60 

CATCCCAATT CAGGACTTGC GACTATCATG TCGGATGGGC CAGGGGGAGA GAAATGGATG 132 0 

TACGTAGGGC AAAATAAAGC AGGTCAAGTT TGGCATGACA TAACTGGAAA TAAACCAGGA 138 0 

ACAGTTACGA T C AAT G C AG A TGGATGGGCT AATTTTTCAG TAAATGGAGG ATCTGTTTCC 14 4 0 

ATTTGGGTGA AACGA 14 55 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCCGCACCGT TTAACGGCAC CAT GAT GC AG TATTTTGAAT GGTACTTGCC GGATGATGGC 60 

ACGTTATGGA CCAAAGTGGC CAATGAAGCC AACAACTTAT CCAGCCTTGG CATCACCGCT 12 0 

CTTTGGCTGC CGCCCGCTTA CAAAGGAACA AGCCGCAGCG ACGTAGGGTA CGGAGTATAC 18 0 

GACTTGTATG ACCTCGGCGA AT T C AAT C AA AAAGGGACCG TCCGCACAAA ATACGGAACA 24 0 

AAAGCTCAAT ATCTTCAAGC CATTCAAGCC GCCCACGCCG CTGGAATGCA AGTGTACGCC 300 

GATGTCGTGT TCGACCATAA AGGCGGCGCT GACGGCACGG AATGGGTGGA CGCCGTCGAA 360 

GTCAATCCGT CCGACCGCAA CCAAGAAATC TCGGGCACCT ATCAAATCCA AGCATGGACG 4 20 

AAATTTGATT TTCCCGGGCG GGGCAACACC TACTCCAGCT TTAAGTGGCG CTGGTACCAT 4 80 
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TTTGACGGCG TTGATTGGGA CGAAAGCCGA AAATTGAGCC GCATTTACAA ATTCCGCGGC 540 
ATCGGCAAAG CGTGGGATTG GGAAGTAGAC ACGGAAAACG GAAACTATGA CTACTTAATG 600 
TATGCCGACC TTGATATGGA TCATCCCGAA GTCGTGACCG AGCTGAAAAA CTGGGGGAAA 660 
TGGTATGTCA AC AC AAC G AA CATTGATGGG TTCCGGCTTG ATGCCGTCAA GCATATTAAG 7 20 

TTCAGTTTTT TTCCTGATTG GTTGTCGTAT GTGCGTTCTC AGACTGGCAA GCCGCTATTT 7 80 

ACCGTCGGGG AATATTGGAG CTATGACATC AACAAGTTGC ACAATTACAT TACGAAAACA 840 
GACGGAACGA TGTCTTTGTT TGATGCCCCG TT AC AC AAC A AATTTTATAC CGCTTCCAAA 900 
TCAGGGGGCG CATTTGATAT GCGCACGTTA ATGACCAATA CTCTCATGAA AG AT C AAC C G 960 

ACATTGGCCG TCACCTTCGT TGATAATCAT GACACCGAAC CCGGCCAAGC GCTGCAGTCA 1020 

TGGGTCGACC CATGGTTCAA ACCGTTGGCT TACGCCTTTA TTCTAACTCG GCAGGAAGGA 108 0 

TACCCGTGCG TCTTTTATGG TGACTATTAT GGCATTCCAC AATATAACAT TCCTTCGCTG 114 0 

AAAAGCAAAA TCGATCCGCT CCTCATCGCG CGCAGGGATT ATGCTTACGG AACGCAACAT 12 00 

GATTATCTTG ATCACTCCGA CATCATCGGG TGGACAAGGG AAGGGGGCAC TGAAAAACCA 12 60 

GGATCCGGAC TGGCCGCACT GATCACCGAT GGGCCGGGAG GAAGCAAATG GATGTACGTT 132 0 

GGCAAACAAC ACGCTGGAAA AGTGTTCTAT GACCTTACCG GCAACCGGAG TGACACCGTC 138 0 

ACCATCAACA GTGATGGATG GGGGGAATTC AAAGTCAATG GCGGTTCGGT TTCGGTTTGG 14 4 0 

GTTCCTAGAA AAACGACCGT TTCTACCATC GCTCGGCCGA TCACAACCCG ACCGTGGACT 1500 

GGTGAATTCG TCCGTTGGAC CGAACCACGG TTGGTGGCAT GGCCTTGA 154 8 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 421 . .1872 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CGGAAGATTG GAAGTACAAA AATAAGCAAA AGATTGTCAA TCATGTCATG AGCCATGCGG 60 

GAGAC GGAAA AATCGTCTTA ATGCACGATA TTTATGCAAC GTTCGCAGAT GCTGCTGAAG 120 

AGATTATTAA AAAGCTGAAA GCAAAAGGCT ATCAATTGGT AACTGTATCT CAGCTTGAAG 18 0 

AAGTGAAGAA GCAGAGAGGC TATTGAATAA ATGAGTAGAA GCGCCATATC GGCGCTTTTC 24 0 



10 
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TTTTGGAAGA AAATATAGGG AAAATGGTAC TTGTTAAAAA TTCGGAATAT TTATACAACA 300 

TCATATGTTT C AC AT T GAAA GGGGAGGAGA AT CAT G AAAC AACAAAAACG GCTTTACGCC 360 

5 CGATTGCTGA CGCTGTTATT TGCGCTCATC TTCTTGCTGC CTCATTCTGC AGCAGCGGCG 4 20 

GCA AAT CTT AAT GGG ACG CTG ATG CAG TAT TTT GAA TGG TAC ATG CCC 4 68 

AAT GAC GGC CAA CAT TGG AGG CGT TTG CAA AAC GAC TCG GCA TAT TTG 516 

GCT GAA CAC GGT ATT ACT GCC GTC TGG ATT CCC CCG GCA TAT AAG GGA 564 

ACG AGC CAA GCG GAT GTG GGC TAC GGT GCT TAC GAC CTT TAT GAT TTA 612 

15 GGG GAG TTT CAT CAA AAA GGG ACG GTT CGG ACA AAG TAC GGC ACA AAA 660 

GGA GAG CTG CAA TCT GCG ATC AAA AGT CTT CAT TCC CGC GAC ATT AAC 7 08 

GTT TAC GGG GAT GTG GTC ATC AAC CAC AAA GGC GGC GCT GAT GCG ACC 756 

20 

GAA GAT GTA ACC GCG GTT GAA GTC GAT CCC GCT GAC CGC AAC CGC GTA 804 

ATT TCA GGA GAA CAC CTA ATT AAA GCC TGG ACA CAT TTT CAT TTT CCG 852 

25 GGG CGC GGC AGC ACA TAC AGC GAT TTT AAA TGG CAT TGG TAC CAT TTT 900 

GAC GGA ACC GAT TGG GAC GAG TCC CGA AAG CTG AAC CGC ATC TAT AAG 94 8 
TTT CAA GGA AAG GCT TGG GAT TGG GAA GTT TCC AAT GAA AAC GGC AAC 996 

30 

TAT GAT TAT TTG ATG TAT GCC GAC ATC GAT TAT GAC CAT CCT GAT GTC 104 4 

GCA GCA GAA ATT AAG AGA TGG GGC ACT TGG TAT GCC AAT GAA CTG CAA 10 92 

35 TTG GAC GGT TTC CGT CTT GAT GCT GTC AAA CAC ATT AAA TTT TCT TTT 114 0 

TTG CGG GAT TGG GTT AAT CAT GTC AGG GAA AAA ACG GGG AAG GAA ATG 118 8 

TTT ACG GTA GCT GAA TAT TGG CAG AAT GAC TTG GGC GCG CTG GAA AAC 1236 

40 

TAT TTG AAC AAA ACA AAT TTT AAT CAT TCA GTG TTT GAC GTG CCG CTT 128 4 

CAT TAT CAG TTC CAT GCT GCA TCG ACA CAG GGA GGC GGC TAT GAT ATG 1332 

45 AGG AAA TTG CTG AAC GGT ACG GTC GTT TCC AAG CAT CCG TTG AAA TCG 1380 

GTT ACA TTT GTC GAT AAC CAT GAT ACA CAG CCG GGG CAA TCG CTT GAG 14 28 

TCG ACT GTC CAA ACA TGG TTT AAG CCG CTT GCT TAC GCT TTT ATT CTC 14 7 6 

50 

ACA AGG GAA TCT GGA TAC CCT CAG GTT TTC TAC GGG GAT ATG TAC GGG 1524 

ACG AAA GGA GAC TCC CAG CGC GAA ATT CCT GCC TTG AAA CAC AAA ATT 157 2 

55 GAA CCG ATC TTA AAA GCG AGA AAA CAG TAT GCG TAC GGA GCA CAG CAT 1620 

GAT TAT TTC GAC CAC CAT GAC ATT GTC GGC TGG ACA AGG GAA GGC GAC 1668 
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AGC TCG GTT GCA AAT TCA GGT TTG GCG GCA TTA ATA ACA GAC GGA CCC 1716 

GGT GGG GCA AAG CGA ATG TAT GTC GGC CGG CAA AAC GCC GGT GAG ACA 17 64 

5 TGG CAT GAC ATT ACC GGA AAC CGT TCG GAG CCG GTT GTC ATC AAT TCG 1812 

GAA GGC TGG GGA GAG TTT CAC GTA AAC GGC GGG TCG GTT TCA ATT TAT 18 60 

GTT CAA AGA TAG AAGAGCAGAG AGGACGGATT TCCTGAAGGA AATCCGTTTT 1912 

10 

TTTATTTT 1920 



(2) INFORMATION FOR SEQ ID NO: 12: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2084 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 343. . 1794 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

25 

GCCCCGCACA TACGAAAAGA CTGGCTGAAA ACATTGAGCC TTTGATGACT GAT GAT T TGG 60 
CTGAAGAAGT GGATCGATTG TTTGAGAAAA G AAG AAG AC C ATAAAAATAC CTTGTCTGTC 120 
30 ATCAGACAGG GTATTTTTTA TGCTGTCCAG ACTGTCCGCT GTGTAAAAAT AAGGAATAAA 18 0 

GGGGGGTTGT TATTATTTTA CTGATATGTA AAATAT AAT T TGTATAAGAA AAT GAGAGGG 24 0 

AGAGGAAACA TGATTCAAAA ACGAAAGCGG ACAGTTTCGT TCAGACTTGT GCTTATGTGC 300 

35 

ACGCTGTTAT TTGTCAGTTT G CCG ATT AC A AAAAC AT C AG CC GTA AAT GGC ACG 35 4 

CTG ATG CAG TAT TTT GAA TGG TAT ACG CCG AAC GAC GGC CAG CAT TGG 4 02 

40 AAA CGA TTG CAG AAT GAT GCG GAA CAT TTA TCG GAT ATC GGA ATC ACT 4 50 

GCC GTC TGG ATT CCT CCC GCA TAC AAA GGA TTG AGC CAA TCC GAT AAC 4 98 

GGA TAC GGA CCT TAT GAT TTG TAT GAT TTA GGA GAA TTC CAG CAA AAA 54 6 

45 

GGG ACG GTC AGA ACG AAA TAC GGC ACA AAA TCA GAG CTT CAA GAT GCG 5 94 

ATC GGC TCA CTG CAT TCC CGG AAC GTC CAA GTA TAC GGA GAT GTG GTT 64 2 

50 TTG AAT CAT AAG GCT GGT GCT GAT GCA ACA GAA GAT GTA ACT GCC GTC 690 
GAA GTC AAT CCG GCC AAT AGA AAT CAG GAA ACT TCG GAG GAA TAT CAA 7 38 

ATC AAA GCG TGG ACG GAT TTT CGT TTT CCG GGC CGT GGA AAC ACG TAC 78 6 

55 

AGT GAT TTT AAA TGG CAT TGG TAT CAT TTC GAC GGA GCG GAC TGG GAT 8 34 

GAA TCC CGG AAG ATC AGC CGC ATC TTT AAG TTT CGT GGG GAA GGA AAA 88 2 
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GCG TGG GAT TGG GAA GTA TCA AGT GAA AAC GGC AAC TAT GAC TAT TTA 930 
ATG TAT GCT GAT GTT GAC TAC GAC CAC CCT GAT GTC GTG GCA GAG ACA 97 8 

5 

AAA AAA TGG GGT ATC TGG TAT GCG AAT GAA CTG TCA TTA GAC GGC TTC 102 6 

CGT ATT GAT GCC GCC AAA CAT ATT AAA TTT TCA TTT CTG CGT GAT TGG 107 4 

10 GTT CAG GCG GTC AGA CAG GCG ACG GGA AAA GAA ATG TTT ACG GTT GCG 1122 
GAG TAT TGG CAG AAT AAT GCC GGG AAA CTC GAA AAC TAC TTG AAT AAA 117 0 

ACA AGC TTT AAT CAA TCC GTG TTT GAT GTT CCG CTT CAT TTC AAT TTA 1218 

15 

CAG GCG GCT TCC TCA CAA GGA GGC GGA TAT GAT ATG AGG CGT TTG CTG 1266 
GAC GGT ACC GTT GTG TCC AGG CAT CCG GAA AAG GCG GTT ACA TTT GTT 1314 
20 GAA AAT CAT GAC ACA CAG CCG GGA CAG TCA TTG GAA TCG ACA GTC CAA 13 62 

ACT TGG TTT AAA CCG CTT GCA TAC GCC TTT ATT TTG ACA AGA GAA TCC 1410 
GGT TAT CCT CAG GTG TTC TAT GGG GAT ATG TAC GGG ACA AAA GGG ACA 14 58 

25 

TCG CCA AAG GAA ATT CCC TCA CTG AAA GAT AAT ATA GAG CCG ATT TTA 1506 
AAA GCG CGT AAG GAG TAC GCA TAC GGG CCC CAG CAC GAT TAT ATT GAC 1S54 
30 CAC CCG GAT GTG ATC GGA TGG ACG AGG GAA GGT GAC AGC TCC GCC GCC 1602 
AAA TCA GGT TTG GCC GCT TTA ATC ACG GAC GGA CCC GGC GGA TCA AAG 1650 
CGG ATG TAT GCC GGC CTG AAA AAT GCC GGC GAG ACA TGG TAT GAC ATA 1698 

35 

ACG GGC AAC CGT TCA GAT ACT GTA AAA ATC GGA TCT GAC GGC TGG GGA 17 4 6 

GAG TTT CAT GTA AAC GAT GGG TCC GTC TCC ATT TAT GTT CAG AAA TAA 17 94 

40 GGTAATAAAA AAACACCTCC AAGCTGAGTG C GGG TAT CAG CTTGGAGGTG CGTTTATTTT 18 5 4 

TTCAGCCGTA TGACAAGGTC GGCATCAGGT GTGACAAATA CGGTATGCTG GCTGTCATAG 1914 
GTGACAAATC CGGGTTTTGC GCCGTTTGGC TTTTTCACAT GTCTGATTTT TGTATAATCA 197 4 

45 

ACAGGCACGG AGCCGGAATC TTTCGCCTTG GAAAAATAAG CGGCGATCGT AGCTGCTTCC 2034 
AAT AT GG ATT GTTCATCGGG ATCGCTGCTT TTAATCACAA CGTGGGATCC 208 4 

50 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
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CAT C AT AAT G GAACAAATGG T AC TAT GAT G CAATATTTCG AATGGTATTT GCCAAATGAC 60 

GGGAATCATT GGAACAGGTT GAGGGATGAC GCAGCTAACT TAAAGAGTAA AGGGATAACA 12 0 

5 

GCTGTATGGA TCCCACCTGC ATGGAAGGGG ACTTCCCAGA AT GAT G TAG G TTATGGAGCC 180 

TATGATTTAT ATGATCTTGG AGAGTTTAAC CAGAAGGGGA CGGTTCGTAC AAAAT AT GGA 24 0 

10 ACACGCAACC AGCTACAGGC TGCGGTGACC TCTTTAAAAA ATAACGGCAT TCAGGTATAT 300 

GGTGATGTCG T CAT G AAT C A TAAAGGTGGA GCAGATGGTA CGGAAATTGT AAATGCGGTA 3 60 

GAAGTGAATC GGAGCAACCG AAACCAGGAA ACCTCAGGAG AGTATGCAAT AGAAGCGTGG 4 20 

15 

ACAAAGTTTG ATTTTCCTGG AAGAGGAAAT AACCATTCCA GCTTTAAGTG GCGCTGGTAT 4 80 

CATTTTGATG GGACAGATTG GGATCAGTCA CGCCAGCTTC AAAACAAAAT ATATAAATTC 54 0 

20 AGGGGAACAG GCAAGGCCTG GGACTGGGAA GTCGATACAG AG AAT G G C AA C TAT G AC TAT 600 

CTTATGTATG CAGACGTGGA TAT G GAT C AC CCAGAAGTAA TACATGAACT TAGAAACTGG 6 60 

GGAGTGTGGT ATACGAATAC ACTGAACCTT GATGGATTTA GAATAGATGC AG T G AAAC AT 720 

25 

ATAAAATATA GCTTTACGAG AGATTGGCTT ACACATGTGC GTAACACCAC AGGTAAACCA 780 

ATGTTTGCAG TGGCTGAGTT TTGGAAAAAT GACCTTGGTG CAATTGAAAA CTATTTGAAT 84 0 

30 AAAACAAGTT GGAATCACTC GGTGTTTGAT GTTCCTCTCC ACTATAATTT GTACAATGCA 900 

TCTAATAGCG GTGGTTATTA T GAT AT GAGA AAT AT T T T AA ATGGTTCTGT GGTGCAAAAA 9 60 

CATCCAACAC ATGCCGTTAC TTTTGTTGAT AACCATGATT CTCAGCCCGG GGAAGCATTG 1020 

35 

GAATCCTTTG TTCAACAATG GTTTAAACCA CTTGCATATG CATTGGTTCT GACAAGGGAA 1080 

CAAGGTTATC CTTCCGTATT TTATGGGGAT TACTACGGTA TCCCAACCCA TGGTGTTCCG 114 0 

40 GCTATGAAAT CT AAAAT AG A CCCTCTTCTG CAGGCACGTC AAACTTTTGC CTATGGTACG 1200 

CAGCATGATT ACTTTGATCA TCATGATATT ATCGGTTGGA CAAGAGAGGG AAATAGCTCC 12 60 

CATCCAAATT CAGGCCTTGC CACCATTATG TCAGATGGTC CAGGTGGTAA CAAATGGATG 1320 

45 

TATGTGGGGA AAAAT AAAGC GGGACAAGTT TGGAGAGATA TTACCGGAAA TAGGACAGGC 138 0 

ACCGTCACAA TTAATGCAGA CGGATGGGGT AATTTCTCTG TTAATGGAGG GTCCGTTTCG 14 4 0 

50 GTTTGGGTGA AGCAA 14 55 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CATCATAATG GGACAAATGG GACGATGATG CAATACTTTG AATGGCACTT GCCTAATGAT 60 
GGGAATCACT GGAATAGATT AAG AG AT GAT GCTAGTAATC TAAGAAATAG AGGTATAACC 120 
GCTATTTGGA TTCCGCCTGC CTGGAAAGGG ACTTCGCAAA ATGATGT GGG GTATGGAGCC 18 0 

TATGATCTTT ATGATTTAGG GGAATTTAAT CAAAAGGGGA CGGTTCGTAC TAAGTATGGG 24 0 
ACACGTAGTC AATTGGAGTC TGCCATCCAT GCTTTAAAGA ATAATGGCGT TCAAGTTTAT 300 
GGGGATGTAG TGATGAACCA TAAAGGAGGA GCTGATGCTA CAGAAAACGT TCTTGCTGTC 360 
15 GAGGTGAATC CAAATAACCG G AAT C AAG AA ATATCTGGGG ACTACACAAT TGAGGCTTGG 42 0 

ACTAAGTTTG ATTTTCCAGG GAGGGGTAAT ACATACTCAG ACTTTAAATG GCGTTGGTAT 4 80 

CATTTCGATG GTGTAGATTG GGATCAATCA CGACAATTCC AAAATCGTAT CTACAAATTC 54 0 

20 

CGAGGTGATG G T AAG G CAT G GGATTGGGAA GTAGATTCGG AAAATGGAAA T TAT GAT TAT 600 
TTAATGTATG CAGATGTAGA TAT G GAT CAT CCGGAGGTAG TAAATGAGCT TAGAAGATGG 660 
25 GGAGAATGGT ATACAAATAC ATTAAATCTT GATGGATTTA GGATCGATGC GGTGAAGCAT 720 
ATTAAATATA GCTTTACACG TGATTGGTTG ACCCATGTAA GAAACGCAAC GGGAAAAGAA 7 80 

ATGTTTGCTG TTGCTGAATT TTGGAAAAAT GATTTAGGTG CCTTGGAGAA CTATTTAAAT 84 0 

30 

AAAACAAACT GGAATCATTC TGTCTTTGAT GTCCCCCTTC ATTATAATCT TTATAACGCG 900 
TCAAATAGTG GAGGCAACTA TGACATGGCA AAACTTCTTA ATGGAACGGT TGTTCAAAAG 960 

35 CATCCAATGC ATGCCGTAAC TTTTGTGGAT AAT C AC GAT T CTCAACCTGG GGAATCATTA 1020 

GAATCATTTG T AC AAG AAT G GTTTAAGCCA CTTGCTTATG CGCTTATTTT AACAAGAGAA 108 0 

CAAGGCTATC CCTCTGTCTT CTATGGTGAC TACTATGGAA TTCCAACACA TAGTGTCCCA 114 0 

40 

GCAATGAAAG CCAAGATTGA TCCAATCTTA GAGGCGCGTC AAAATTTTGC AT AT G G AAC A 1200 

CAACATGATT ATTTTGACCA T CAT AAT AT A ATCGGATGGA CACGTGAAGG AAATACCACG 12 60 

45 CATCCCAATT CAGGACTTGC GACTATCATG TCGGATGGGC CAGGGGGAGA GAAATGGATG 1320 

TACGTAGGGC AAAATAAAGC AGGTCAAGTT TGGCATGACA TAACTGGAAA TAAACCAGGA 138 0 

AC AG T T AC G A TCAATGCAGA TGGATGGGCT AATTTTTCAG TAAATGGAGG ATCTGTTTCC 14 4 0 

ATTTGGGTGA AACGA 14 55 



50 



(2) INFORMATION FOR SEQ ID NO: 15: 
55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Forward Primer FSA" 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature 
(B) LOCATION: 22-27,2 9,31-33,41 
(D) : OTHER INFORMATION: 



/Note= 


1: 35% A, 65% 


C 


2 : 


8 3 % G , 


17% A 




3: 


63% G, 


37% T 




4 : 


86% G, 


14% A 




5: 


85% G, 


15% C 




6: 


50% T, 


50% C 




7 : 


95% A, 


5%G 




8 : 


58% G, 


37% A, 


5% 


9: 


86% C, 


13% A, 


1% 


10 


: 83% T, 


17% G 




11 


: 92% G, 


8% C 





T 
G 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

caaaatcgta tctacaaatt cl23456a7g 8910tgggatt 
llggaagtaga ttcggaaaat 

60 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc = "Reverse Primer RSA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

gaatttgtag atacgatttt g 

21 



(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer Bl" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGATTGCTGA CGCTGTTATT TGCG 24 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
5 (ix) FEATURE: 

(A) NAME /KEY : misc-feature: 

(B) OTHER INFORMATION: /desc = "Primer Y2" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

10 CTTGTTCCCT TGTCAGAACC AATG 24 

(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
20 (A) NAME/KEY: misc-feature: 

(B) OTHER INFORMATION: /desc - "Primer 101458" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



25 



GTCATAGTTG CCGAAATCTG TATCGACTTC 30 



(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-feature: 
35 ( B ) OTHER INFORMATION: /desc = "Primer 101638" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CCCAGTCCCA CGTACGTCCC C T GAAT T TAT AT A TTTTG 35 

40 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-feature: 

(B) OTHER INFORMATION: /desc = "Oligo 1" 
50 (A) NAME /KEY : misc-feature 

(B) LOCATION: 12 

(D) : OTHER INFORMATION: /Note=N= 25% A, 25% C, 25% G, 25% T. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

55 CCCAGTCCCA GNTCTTTCCC CTGAATTTAT ATATTTTG 38 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer X2" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GCGTGGACAA AGTTTGATTT TCCTG 25 



(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc - "Primer DA01" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc- feature : 

(B) OTHER INFORMATION: /desc - "Primer DA03" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GCATTGGATG CTTTTGAACA ACCG 2 4 

2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA07" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

C G C AAAAT G A TATCGGGTAT GGAGCC 2 6 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
5 (A) NAME/ KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA20" 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature 

(B) LOCATION: 13,14 

10 (D) : OTHER INFORMATION: /Note:S= mixture of C and G 

W= mixture of A and T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GTGATGAACC ACSWAGGTGG AGCTGATGC 2 9 

15 (2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY: misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer DA14" 
25 (ix) FEATURE: 

(A) NAME/KEY: misc-f eature 

(B) LOCATION: 13,14 

(D) : OTHER INFORMATION: /Note:R= mixture of A and G 

Y= mixture of C and T 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GATGGTGTAT. GGRYCAATCA CGACAATTCC 30 

2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
40 (ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA15" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

GGTGTATGGG ATAACTCACG ACAATTCC 28 

45 

2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 
55 (B) OTHER INFORMATION: /desc = "Primer DA16" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GGTGTATGGG ATCTCTCACG ACAATTCC 2 8 

2) INFORMATION FOR SEQ ID NO: 30: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc - "Primer DA17" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
15 GGGATCAATC ACGAAATTTC CAAAATCGTA TC 32 



2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
25 (A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA18" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GGGATCAATC ACGACTCTTC CAAAATCGTA TC 32 

30 2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA06" 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GGAAATTATG ATTATATCAT GTATGCAGAT GTAG 34 

2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
50 (ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA09" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GCTGAATTTT GGTCGAATGA TTTAGGTGCC 30 
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2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
10 (A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA11" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GCTGAATTTT GGTCGAATGA TTTAGGTGCC 30 

15 2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 ( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA21" 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GAATTTTGGA AGTACGATTT AGGTCGG 27 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
35 (ix) FEATURE: 

(A) NAME/ KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA12" 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature 
40 (B) LOCATION: 12,13 

(D) : OTHER INFORMATION: /Note : R= mixture of A and G 

Y= mixture of C and T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



45 



GGAAAAACGA TRYCGGTGCC TTGGAGAAC 2 9 



(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY: misc-f eature : 
55 (B) OTHER INFORMATION: /desc = "Primer DA13" 
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(ix) FEATURE: 

(A) NAME /KEY : misc-f eature 

(B) LOCATION: 14,15 

(D) : OTHER INFORMATION: /Note:R= mixture of A and G 

Y— mixture of C and T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

GATTTAGGTG CCTRYCAGAA CTATTTA 

2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA08' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CCCCCTTCAT GAGAATCTTT ATAACG 

2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA04 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GAATCCGAAC CTCATTACAC ATTCG 

2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA05 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CGGATGGACT CGAGAAGGAA ATACCACG 

2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
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(A) NAME /KEY : misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer DA10" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CGTAGGGCAA AATCAGGCCG GTCAAGTTTG G 3 

2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA22" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CATAACTGGA AATCGCCCGG GAACAGTTAC G 3 



(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/ KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA19" 

(ix) FEATURE: 

(A) NAME/KEY: misc-f eature 

(B) LOCATION: 12 

(D) : OTHER INFORMATION: /Note:W= mixture of A and T 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 3 

CTGGAAATAA AWCCGGAACA GTTACG 

2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA23" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GGAAATAAAC CAGGACCCGT TACGATCAAT GC 



2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
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(A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer DA32" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

GAGGCTTGGA CTAGGTTTGA TTTTCCAG 2 8 

5 

2) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 
15 (B) OTHER INFORMATION: /desc = "Primer DA31" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GCTGAATTTT GGCGCAATGA TTTAGGTGCC 30 



2) INFORMATION FOR SEQ ID NO: 47: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer bm4 " 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
30 GTGTTTGACG TCCCGCTTCA TGAGAATTTA CAGG 34 



2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
40 (A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer bm5" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GTGTTTGACG TCCCGCTTCA T AAGAAT T T A CAGG 34 



45 2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer bm6" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTGTTTGACG TCCCGCTTCA TGCCAATTTA CAGG 34 



2) INFORMATION FOR SEQ ID NO: 50: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer bm8" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
15 AGGGAATCCG GATACCCTGA GGTTTTCTAC GG 32 



2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
20 ( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 
25 (A) NAME /KEY : misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer bmll" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GATGTGGTTT TGGATCATAA GGCCGGCGCT GATG 34 



30 2) INFORMATION FOR SEQ ID NO: 52 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer pi" 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

CTGTTATTAA TGCCGCCAAA CC 22 



2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH : 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
50 (ix) FEATURE: 

(A) NAME/KEY: misc- feature : 

(B) OTHER INFORMATION: /desc = "Primer p2" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



WO 99/23211 



PCT/DK98/00471 



34 

GGAAAAGAAA TGTTTACGGT TGCG 2 4 



2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
10 (ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 

(B) OTHER INFORMATION: /desc = "Primer p3" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

GAAATGAAGC GGAACAT CAA ACACG 25 

15 

2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc-f eature : 
25 (B) OTHER INFORMATION: /desc = "Primer p4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GTATGATTTA GGAGAATTCC 20 
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